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(57) Abstract: The present invention provides a method of identifying a human subject having increased or decreased sensitivity 
Q to warfarin, comprising detecting in the subject the presence of a single nucleotide polymorphism in the VKOR gene, wherein the 
ST single nucleotide polymorphism is correlated with increased or decreased sensitivity to warfarin, thereby identifying the subject 

having increased or decreased sensitivity to warfarin. 
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METHODS AND COMPOSITIONS FOR THE CORRELATION OF SINGLE 
NUCLEOTIDE POLYMORPHISMS IN THE VITAMIN K EPOXIDE REDUCTASE 

GENE AND WARFARIN DOSAGE 

5 

Statement of Priority 

The present application claims the benefit, under 35 U.S.C. § 1 19(e), of U.S. 
Provisional Application Serial No. 60/505,527, filed September 23, 2003, the entire 
contents of which are incorporated by reference herein. 

10 

Government Support 

The present invention was made, in part, with the support of grant numbers 
5P01 HL06350-42 and 5-R01 HL48318 from the National Institutes of Health. The 
United States Government has certain rights to this invention. 

15 

Field of the Invention 

The present invention concerns isolated nucleic acids, host cells containing 
the same, and methods of use thereof, as well as methods and compositions 
directed to identification of single nucleotide polymorphisms (SNPs) in the Vitamin K 
20 epoxide reductase (VKOR) gene and their correlation with sensitivity to warfarin. 

Background of the Invention 

The function of numerous proteins requires the modification of multiple 
glutamic acid residues to y-carboxyglutamate. Among these vitamin K-dependent 

25 (VKD) coagulation proteins, FIX (Christmas factor), FVII, and prothrombin are the 
best known. The observation that a knock-out of the gene for matrix Gla protein 
results in calcification of the mouse's arteries (Luo et al. (1997) "Spontaneous 
calcification of arteries and cartilage in mice lacking matrix GLA protein" Nature 
386:78-81) emphasizes the importance of the vitamin K cycle for proteins with 

30 functions other than coagulation. Moreover, Gas6 and other Gla proteins of 
unknown function are expressed in neural tissue and warfarin exposure in utero 
results in mental retardation and facial abnormalities. This is consistent with the 
observation that the expression of VKD carboxylase, the enzyme that accomplishes 
the Gla modification, is temporally regulated in a tissue-specific manner with high 
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expression in the nervous system during early embryonic stages. Concomitant with 
carboxylation, reduced vitamin K, a co-substrate of the reaction, is converted to 
vitamin K epoxide. Because the amount of vitamin K in the human diet is limited, 
vitamin K epoxide must be converted back to vitamin K by vitamin K epoxide 
5 reductase (VKOR) to prevent its depletion. Warfarin, the most widely used 

anticoagulation drug, targets VKOR and prevents the regeneration of vitamin K. The 
consequence is a decrease in the concentration of reduced vitamin K, which results 
in a reduced rate of carboxylation by the y-glutamyl carboxylase and in the 
production of undercarboxylated vitamin K-dependent proteins. 

10 In the United States alone, warfarin is prescribed to more than one million 

patients per year and in Holland, it has been reported that approximately 2% of the 
population is on long term warfarin therapy. Because the dose of warfarin required 
for a therapeutic level of anticoagulation varies greatly between patients, the 
utilization of warfarin is accompanied by a significant risk of side effects. For 

15 example, it has been reported that following initiation of warfarin therapy, major 
bleeding episodes occurred in 1-2% of patients and death occurred in 0.1-0.7 % of 
patients. In spite of the dangers, it has been estimated that warfarin use can prevent 
20 strokes per induced bleeding episode and is probably underutilized because of 
the fear of induced bleeding. 

20 The present invention overcomes previous shortcomings in the art by 

providing methods and compositions for correlating single nucleotide polymorphisms 
in a subject with an increased or decreased sensitivity to warfarin, thereby allowing 
for more accurate and rapid determination of therapeutic and maintenance doses of 
warfarin at reduced risk to the subject. 

25 

Summary of the invention 

The present invention provides a method of identifying a human subject 
having increased or decreased sensitivity to warfarin, comprising detecting in the 
subject the presence of a single nucleotide polymorphism in the VKOR gene, 
30 wherein the single nucleotide polymorphism is correlated with increased or 
decreased sensitivity to warfarin, thereby identifying the subject having increased or 
decreased sensitivity to warfarin. 

Additionally provided is a method of identifying a human subject having 
increased or decreased sensitivity to warfarin, comprising: a) correlating the 
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presence of a single nucleotide polymorphism in the VKOR gene with increased or 
decreased sensitivity to warfarin; and b) detecting the single nucleotide 
polymorphism of step (a) in the subject, thereby identifying a subject having 
increased or decreased sensitivity to warfarin. 
5 In a further embodiment, the present invention provides a method of 

identifying a single nucleotide polymorphism in the VKOR gene correlated with 
increased or decreased sensitivity to warfarin, comprising: 

a) identifying a subject having increased or decreased sensitivity to warfarin; 

b) detecting in the subject the presence of a single nucleotide polymorphism 
10 in the VKOR gene; and 

c) correlating the presence of the single nucleotide polymorphism of step (b) 
with the increased or decreased sensitivity to warfarin in the subject, thereby 
identifying a single nucleotide polymorphism in the VKOR gene correlated with 
increased or decreased sensitivity to warfarin. 

15 In addition, the present invention provides a method of correlating a single 

nucleotide polymorphism in the VKOR gene of a subject with increased or decreased 
sensitivity to warfarin, comprising: a) identifying a subject having increased or 
decreased sensitivity to warfarin; b) determining the nucleotide sequence of the 
VKOR gene of the subject of (a); c) comparing the nucleotide sequence of step (b) 

20 with the wild type nucleotide sequence of the VKOR gene; d) detecting a single 
nucleotide polymorphism in the nucleotide sequence of (b); and e) correlating the 
single nucleotide polymorphism of (d) with increased or decreased sensitivity to 
warfarin in the subject of (a). 

A further aspect of the present invention is an isolated nucleic acid encoding 

25 vitamin K epoxide reductase (VKOR), particularly mammalian (e.g., human, ovine, 
bovine, monkey, etc.) VKOR. Examples include (a) nucleic acids as disclosed 
herein, such as isolated nucleic acids having the nucleotide sequence as set forth in 
SEQ ID NO: 8 or SEQ ID NO: 9; (b) nucleic acids that hybridize to isolated nucleic 
acids of (a) above or the complement thereof (e.g., under stringent conditions), 

30 and/or have substantial sequence identity to nucleic acids of (a) above (e.g., are 80, 
85, 90 95 or 99% identical to nucleic acids of (a) above), and encode a VKOR; and 
(c) nucleic acids that differ from the nucleic acids of (a) or (b) above due to the 
degeneracy of the genetic code, but code for a VKOR encoded by a nucleic acid of 
(a) or (b) above. 
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The term "stringent 11 as used here refers to hybridization conditions that are 
commonly understood in the art to define the commodities of the hybridization 
procedure. Stringency conditions can be low, high or medium, as those terms are 
commonly know in the art and well recognized by one of ordinary skill. High 
5 stringency hybridization conditions that will permit homologous nucleotide sequences 
to hybridize to a nucleotide sequence as given herein are well known in the art. As 
one example, hybridization of such sequences to the nucleic acid molecules 
disclosed herein can be carried out in 25% formamide, 5X SSC, 5X Denhardt's 
solution and 5% dextran sulfate at 42°C, with wash conditions of 25% formamide, 5X 

10 SSC and 0.1% SDS at 42°G, to allow hybridization of sequences of about 60% 
homology. Another example includes hybridization conditions of 6X SSC, 0.1% SDS 
at about 45°C, followed by wash conditions of 0.2X SSC, 0.1% SDS at 50-65°C. 
Another example of stringent conditions is represented by a wash stringency of 0.3 
M NaCI, 0.03M sodium citrate, 0.1% SDS at 6070°C using a standard hybridization 

15 assay (see SAMBROOK et al., EDS., MOLECULAR CLONING: A LABORATORY 
MANUAL 2d ed. (Cold Spring Harbor, NY 1989, the entire contents of which are 
incorporated by reference herein). In various embodiments, stringent conditions can 
include, for example, highly stringent (i.e., high stringency) conditions (e.g., 
hybridization to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate 

20 (SDS), 1 mM EDTA at 65°C., and washing in 0.1xSSC/0.1% SDS at 68°C), and/or 
moderately stringent (i.e., medium stringency) conditions (e.g., washing in 
0.2xSSC/0.1% SDS at42°C.). 

An additional aspect of the present invention is a recombinant nucleic acid 
comprising a nucleic acid encoding vitamin K epoxide reductase as described herein 

25 operatively associated with a heterologous promoter. 

A further aspect of the present invention is a cell that contains and expresses 
a recombinant nucleic acid as described above. Suitable cells include plant, animal, 
mammal, insect, yeast and bacterial cells. 

A further aspect of the present invention is an oligonucleotide that hybridizes 

30 to an isolated nucleic acid encoding VKOR as described herein. 

A further aspect of the present invention is isolated and purified VKOR (e.g., 
VKOR purified to homogeneity) encoded by a nucleic acid as described herein. For 
example, the VKOR of this invention can comprise the amino acid sequence as set 
forth in SEQ ID NO: 10. 



4 



WO 2005/030039 PCT/US2004/031481 

A further aspect of the present invention is a method of making a vitamin K 
dependent protein which comprises culturing a host cell that expresses a nucleic 
acid encoding a vitamin K dependent protein in the presence of vitamin K and 
produces a vitamin K dependent protein, and then harvesting the vitamin K 
5 dependent protein from the culture, the host cell containing and expressing a 

heterologous nucleic acid encoding vitamin K dependent carboxylase, and the host 
cell further containing and expressing a heterologous nucleic acid encoding vitamin 
K epoxide reductase (VKOR) and producing VKOR as described herein. 

10 Brief Description of the Drawings 

Figures 1A-D Comparisons of warfarin dosages in wild type, heterozygous 
and homozygous subjects for SNPs vk 2581, vk3294 and vk4769, as well as a 
comparison of warfarin dosages in wild type and heterozygous subjects for P450 
2Y9. 

1 5 Figure 2. For each of the 1 3 siRNA pools, three T7 flasks containing A549 

cells were transfected and VKOR activity determined after 72 h. The VKOR assay 
used 25 pM vitamin K epoxide. One siRNA pool specific for gene gi: 131 24769 
reduced VKOR activity by 64%-70% in eight repetitions. 

Figure 3. Time course of inhibition of VKOR activity by the siRNA pool 

20 specific for gi: 1 31 24769 in A549 cells. VKOR activity decreased continuously during 
this time period while the level of its mRNA decreased rapidly to about 20% of 
normal. 25 pM vitamin K epoxide was used for this assay. The siRNA did not affect 
the activity of VKD carboxylase or the level of lamin A/C mRNA. 

Figure 4. VKOR activity was detected when mGC_1 1276 was expressed in 

25 Sf9 insect cells. -1x1 0 6 cells were used in this assay. Reactions were performed 
using 32 pM KO at 30°C for 30 minutes in Buffer D. Blank Sf9 cells served as a 
negative control and A549 cells as a reference. 

Figure 5. Inhibition of VKOR by warfarin. Reactions were performed using 1.6 
mg microsomal proteins made from VKOR_Sf9 cells, 60 pM KO, and various 

30 concentration of warfarin at 30°C for 1 5 minutes in Buffer D. 

Detailed Description of the Invention 

As used herein, w a," "an" or "the" can mean one or more than one. For 
example, M a n cell can mean a single cell or a multiplicity of cells. 
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The present invention is explained in greater detail below. This description is 
not intended to be a detailed catalog of all the different ways in which the invention 
may be implemented, or all the features that may be added to the instant invention. 
For example, features illustrated with respect to one embodiment may be 
5 incorporated into other embodiments, and features illustrated with respect to a 

particular embodiment may be deleted from that embodiment In addition, numerous 
variations and additions to the various embodiments suggested herein will be 
apparent to those skilled in the art in light of the instant disclosure which do not 
depart from the instant invention. Hence, the following specification is intended to 
-10 ~ illustrate some particular embodiments of the invention, and not to exhaustively 
specify all permutations, combinations and variations thereof. 

The "Sequence Listing" attached hereto forms a part of the instant 
specification as if fully set forth herein. 

The present invention may be carried out based on the instant disclosure and 
15 further utilizing methods, components and features known in the art, including but 
not limited to those described in US Patent No. 5,268,275 to Stafford and Wu and 
US Patent No. 6,531,298 to Stafford and Chang, the disclosures of which are 
incorporated by reference herein in their entirety as if fully set forth herein. 

As used herein, "nucleic acids" encompass both RNA and DNA, including 
20 cDNA, genomic DNA, synthetic (e.g., chemically synthesized) DNA and chimeras of 
RNA and DNA. The nucleic acid may be double-stranded or single-stranded. Where 
single-stranded, the nucleic acid may be a sense strand or an antisense strand. The 
nucleic acid may be synthesized using oligonucleotide analogs or derivatives (e.g., 
inosine or phosphorothioate nucleotides). Such oligonucleotides can be used, for 
25 example, to prepare nucleic acids that have altered base-pairing abilities or 
increased resistance to nucleases. 

An "isolated nucleic acid" is a DNA or RNA that is not immediately contiguous 
with both of the coding sequences with which it is immediately contiguous (one on 
the 5' end and one on the 3' end) in the naturally occurring genome of the organism 
30 from which it is derived. Thus, in one embodiment, an isolated nucleic acid includes 
some or all of the 5 1 non-coding (e.g., promoter) sequences that are immediately 
contiguous to the coding sequence. The term therefore includes, for example, a 
recombinant DNA that is incorporated into a vector, into an autonomously replicating 
plasmid or virus, or into the genomic DNA of a prokaryote or eukaryote, or which 
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exists as a separate molecule (e.g., a cDNA or a genomic DNA fragment produced 
by PCR or restriction endonuclease treatment), independent of other sequences. It 
also includes a recombinant DNA that is part of a hybrid gene encoding an additional 
polypeptide sequence. 
5 The term "isolated" can refer to a nucleic acid or polypeptide that is 

substantially free of cellular material, viral material, or culture medium (when 
produced by recombinant DNA techniques), or chemical precursors or other 
chemicals (when chemically synthesized). Moreover, an "isolated nucleic acid 
fragment" is a nucleic acid fragment that is not naturally occurring as a fragment and 
- 10 would not be found in the natural state. 

The term "oligonucleotide" refers to a nucleic acid sequence of at least about 
six nucleotides to about 100 nucleotides, for example, about 15 to 30 nucleotides, or 
about 20 to 25 nucleotides, which can be used, for example, as a primer in a PCR 
amplification or as a probe in a hybridization assay or in a microarray. 
15 Oligonucleotides may be natural or synthetic, e.g., DNA, RNA, modified backbones, 
etc. 

Where a particular nucleotide sequence is said to have a specific percent 
identity to a reference nucleotide sequence, the percent identity is relative to the 
reference nucleotide sequence. For example, a nucleotide sequence that is 50%, 

20 75%, 85%, 90%, 95% or 99% identical to a reference nucleotide sequence that is 
100 bases long can have 50, 75, 85, 90, 95 or 99 bases that are completely identical 
to a 50, 75, 85, 90, 95 or 99 nucleotide sequence of the reference nucleotide 
sequence. The nucleotide sequence can also be a 100 base long nucleotide 
sequence that is 50%, 75%, 85%, 90%, 95% or 99% identical to the reference 

25 nucleotide sequence over its entire length. Of course, there are other nucleotide 
sequences that will also meet the same criteria. 

A nucleic acid sequence that is "substantially identical" to a VKOR nucleotide 
sequence is at least 80%, 85% 90%, 95% or 99% identical to the nucleotide 
sequence of SEQ ID NO:8 or 9. For purposes of comparison of nucleic acids, the 

30 length of the reference nucleic acid sequence will generally be at least 40 

nucleotides, e.g., at least 60 nucleotides or more nucleotides. Sequence identity can 
be measured using sequence analysis software (e.g., Sequence Analysis Software 
Package of the Genetics Computer Group, University of Wisconsin Biotechnology 
Center, 1710 University Avenue, Madison, Wis. 53705). 
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As is known in the art, a number of different programs can be used to identify 
whether a nucleic acid or amino acid has sequence identity or similarity to a known 
sequence. Sequence identity or similarity may be determined using standard 
techniques known in the art, including, but not limited to, the local sequence identity 
5 algorithm of Smith & Waterman, Adv. Appl. Math. 2, 482 (1981), by the sequence 
identity alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48,443 (1970), by 
the search for similarity method of Pearson & Lipman, Proc. Natl. Acad. ScL USA 
85,2444 (1988), by computerized implementations of these algorithms (GAP, 
BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 

10 Genetics Gomputer Group, 575 Science Drive, Madison, Wl), the Best Fit sequence 
program described by Devereux et a/., Nucl. Acid Res. 12, 387-395 (1984), 
preferably using the default settings, or by inspection. 

An example of a useful algorithm is PILEUP. PILEUP creates a multiple 
sequence alignment from a group of related sequences using progressive, pairwise 

15 alignments. It can also plot a tree showing the clustering relationships used to 
create the alignment. PILEUP uses a simplification of the progressive alignment 
method of Feng & Doolittle, J. MoL Evol. 35, 351-360 (1987); the method is similar to 
that described by Higgins & Sharp, CABIOS 5, 151-153 (1989). 

Another example of a useful algorithm is the BLAST algorithm, described in 

20 Altschul et a/., J. Mol. Biol. 215, 403-410, (1990) and Kariin et a/., Proc. Natl. Acad. 
Sci. USA 90, 5873-5787 (1993). A particularly useful BLAST program is the WU- 
BLAST-2 program that was obtained from Altschul et a/., Methods in Enzymology, 
266, 460-480 (1996). WU-BLAST-2 uses several search parameters, which are 
preferably set to the default values. The parameters are dynamic values and are 

25 established by the program itself depending upon the composition of the particular 
sequence and composition of the particular database against which the sequence of 
interest is being searched; however, the values may be adjusted to increase 
sensitivity. An additional useful algorithm is gapped BLAST as reported by Altschul 
etai Nucleic Acids Res. 25, 3389-3402. 

30 The CLUSTAL program can also be used to determine sequence similarity. 

This algorithm is described by Higgins et al. (1988) Gene 73:237; Higgins et al. 
(1989) CABIOS 5:151-153; Corpet et al. (1988) Nucleic Acids Res. 16: 10881-90; 
Huang etal. (1992) CABIOS 8: 155-65; and Pearson etal. (1994) Meth. Mol. Biol. 
24: 307-331. 
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In addition, for sequences that contain either more or fewer nucleotides than 
the nucleic acids disclosed herein, it is understood that in one embodiment, the 
percentage of sequence identity will be determined based on the number of identical 
nucleotides in relation to the total number of nucleotide bases. Thus, for example, 

5 sequence identity of sequences shorter than a sequence specifically disclosed 
herein will be determined using the number of nucleotide bases in the shorter 
sequence, in one embodiment. In percent identity calculations, relative weight is not 
assigned to various manifestations of sequence variation, such as, insertions, 
deletions, substitutions, efc. 
. 1 o The VKOR polypeptides of the invention include, but are not limited to, 

recombinant polypeptides, synthetic peptides and natural polypeptides. The 
invention also encompasses nucleic acid sequences that encode forms of VKOR 
polypeptides in which naturally occurring amino acid sequences are altered or 
deleted. Preferred nucleic acids encode polypeptides that are soluble under normal 

15 physiological conditions. Also within the invention are nucleic acids encoding fusion 
proteins in which all or a portion of VKOR is fused to an unrelated polypeptide (e.g., 
a marker polypeptide or a fusion partner) to create a fusion protein. For example, 
the polypeptide can be fused to a hexa-histidine tag to facilitate purification of 
bacterially expressed polypeptides, or to a hemagglutinin tag to facilitate purification 

20 of polypeptides expressed in eukaryotic cells, or to an HPC4 tag to facilitate 

purification of polypeptides by affinity chromatography or immunoprecipitation. The 
invention also includes isolated polypeptides (and the nucleic acids that encode 
these polypeptides) that include a first portion and a second portion; the first portion 
includes, e.g., all or a portion of a VKOR polypeptide, and the second portion 

25 includes, e.g., a detectable marker. 

The fusion partner can be, for example, a polypeptide that facilitates 
secretion, e.g., a secretory sequence. Such a fused polypeptide is typically referred 
to as a preprotein. The secretory sequence can be cleaved by the cell to form the 
mature protein. Also within the invention are nucleic acids that encode VKOR fused 

30 to a polypeptide sequence to produce an inactive preprotein. Preproteins can be 
converted into the active form of the protein by removal of the inactivating sequence. 

The invention also includes nucleic acids that hybridize, e.g., under stringent 
hybridization conditions (as defined herein) to all or a portion of the nucleotide 
sequence of SEQ ID NOS: 1-6, 8 or 9 or their complements. In particular 

9 
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embodiments, the hybridizing portion of the hybridizing nucleic acid is typically at 
least 15 (e.g., 20, 30, or 50) nucleotides in length. The hybridizing portion of the 
hybridizing nucleic acid is at least 80%, e.g., at least 95%, at least 98% or 100%, 
identical to the sequence of a portion or all of a nucleic acid encoding a VKOR 
5 polypeptide. Hybridizing nucleic acids of the type described herein can be used, for 
example, as a cloning probe, a primer (e.g., a PCR primer), or a diagnostic probe. 
Also included within the invention are small inhibitory RNAs (siRNAs) and/or 
antisense RNAs that inhibit the function of VKOR, as determined, for example, in an 
activity assay, as described herein and as is known in the art. 
-10 - In another embodiment, the invention features cells, e.g., transformed cells, 

that contain a nucleic acid of this invention. A "transformed cell" is a cell into which 
(or into an ancestor of which) has been introduced, by means of recombinant nucleic 
acid techniques, a nucleic acid encoding all or a part of a VKOR polypeptide, and/or 
an antisense nucleic acid or siRNA. Both prokaryotic and eukaryotic cells are 

15 included, e.g., bacteria, yeast, insect, mouse, rat, human, plant and the like. 

The invention also features nucleic acid constructs (e.g., vectors and 
plasmids) that include a nucleic acid of the invention that is operably linked to a 
transcription and/or translation control elements to enable expression, e.g., 
expression vectors. By "operably linked" is meant that a selected nucleic acid, e.g., a 

20 DNA molecule encoding a VKOR polypeptide, is positioned adjacent to one or more 
regulatory elements, e.g., a promoter, which directs transcription and/or translation of 
the sequence such that the regulatory elements can control transcription and/or 
translation of the selected nucleic acid. 

The present invention further provides fragments or oligonucleotides of the 

25 nucleic acids of this invention, which can be used as primers or probes. Thus, in 
some embodiments, a fragment or oligonucleotide of this invention is a nucleotide 
sequence that is at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 
90, 100, 125, 150, 175, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 
800, 850, 900, 1000, 1500, 2000, 2500 or 3000 contiguous nucleotides of the 

30 nucleotide sequence set forth in SEQ ID NO:8 or SEQ ID NO:9. Examples of 
oligonucleotides of this invention are provided in the Sequence Listing included 
herewith. Such fragments or oligonucleotides can be detectably labeled or modified, 
for example, to include and/or incorporate a restriction enzyme cleavage site when 
employed as a primer in an amplification (e.g., PCR) assay. 
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The invention also features purified or isolated VKOR polypeptides, such as, 
for example, a polypeptide comprising, consisting essentially of and/or consisting of 
the amino acid sequence of SEQ ID NO:10 or a biologically active fragment or 
peptide thereof. Such fragments or peptides are typically at least about ten amino 
5 acids of the amino acid sequence of SEQ ID NO:10 (e.g., 15, 20, 25, 30, 35, 40, 45, 
50, 55, 60, 65, 75, 85, 95, 100, 125, or 150 amino acids of the amino acid sequence 
of SEQ ID NO:10) and can be peptides or fragment of contiguous amino acids of the 
amino acid sequence of the VKOR protein (e.g., as set forth in SEQ ID NO:10). The 
biological activity of a fragment or peptide of this invention can be determined 

10 according to the methods provided herein and as are known in the art for identifying 
VKOR activity. The fragments and peptides of the VKOR protein of this invention 
can also be active as antigens for the production of antibodies. The identification of 
epitopes on a fragment or peptide of this invention is carried out by well known 
protocols and would be within the ordinary skill of one in the art. 

15 As used herein, both "protein" and "polypeptide" mean any chain of amino 

acids, regardless of length or post-translational modification (e.g., glycosylation, 
phosphorylation or N-myristylation). Thus, the term "VKOR polypeptide" includes 
full-length, naturally occurring VKOR proteins, respectively, as well as recombinants 
or synthetically produced polypeptides that correspond to a full-length, naturally 

20 occurring VKOR protein, or to a portion of a naturally occurring or synthetic VKOR 
polypeptide. 

A "purified" or "isolated" compound or polypeptide is a composition that is at 
least 60% by weight the compound of interest, e.g., a VKOR polypeptide or antibody 
that is separated or substantially free from at least some of the other components of 

25 the naturally occurring organism or virus, for example, the cell or viral structural 
components or other polypeptides or nucleic acids commonly found associated with 
the polypeptide. As used herein, the "isolated 1 ' polypeptide is at least about 25%, 
50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99% or more pure (w/w). 
Preferably the preparation is at least 75% (e.g., at least 90% or 99%) by weight the 

30 compound of interest. Purity can be measured by any appropriate standard method, 
e.g., column chromatography, polyacrylamide gel electrophoresis, or HPLC analysis. 

Preferred VKOR polypeptides include a sequence substantially identical to all 
or a portion of a naturally occurring VKOR polypeptide. Polypeptides "substantially 
identical" to the VKOR polypeptide sequences described herein have an amino acid 
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sequence that is at least 80% or 85% (e.g., 90%, 95% or 99%) identical to the amino 
acid sequence of the VKOR polypeptides of SEQ ID NO: 10. For purposes of 
comparison, the length of the reference VKOR polypeptide sequence will generally 
be at least 16 amino acids, e.g., at least 20, 25, 30, 35, 40, 45, 50, 75, or 100 amino 
5 acids. 

In the case of polypeptide sequences that are less than 100% identical to a 
reference sequence, the non-identical positions are preferably, but not necessarily, 
conservative substitutions for the reference sequence. Conservative substitutions 
typically include, but are not limited to, substitutions within the following groups: 

10 glycine and alanine; valine, isoleucine, and leucine; aspartic acid and glutamic acid; 
asparagine and glutamine; serine and threonine; lysine and arginine; and 
phenylalanine and tyrosine. 

Where a particular polypeptide is said to have a specific percent identity to a 
reference polypeptide of a defined length, the percent identity is relative to the 

15 reference polypeptide. Thus, for example, a polypeptide that is 50%, 75%, 85%, 
90%, 95% or 99% identical to a reference polypeptide that is 100 amino acids long 
can be a 50, 75, 85, 90, 95 or 99 amino acid polypeptide that is completely identical 
to a 50, 75, 85, 90, 95 or 99 amino acid long portion of the reference polypeptide. It 
can also be a 100 amino acid long polypeptide that is 50%, 75%, 85%, 90%, 95% or 

20 99% identical to the reference polypeptide over its entire length. Of course, other 
polypeptides also will meet the same criteria. 

The invention also features purified or isolated antibodies that specifically bind 
to a VKOR polypeptide of this invention or to a fragment thereof. By "specifically 
binds" is meant that an antibody recognizes and binds a particular antigen, e.g., a 

25 VKOR polypeptide, or an epitope on a fragment or peptide of a VKOR polypeptide, 
but does not substantially recognize and bind other molecules in a sample. In one 
embodiment the antibody is a monoclonal antibody and in other embodiments, the 
antibody is a polyclonal antibody. The production of both monoclonal and polyclonal 
antibodies, including chimeric antibodies, humanized antibodies, single chain 

30 antibodies, bi-specific antibodies, antibody fragments, etc., is well known in the art. 
In another aspect, the invention features a method for detecting a VKOR 
polypeptide in a sample. This method comprises contacting the sample with an 
antibody that specifically binds a VKOR polypeptide or a fragment thereof under 
conditions that allow the formation of a complex between an antibody and VKOR; 
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and detecting the formation of a complex, if any, as detection of a VKOR polypeptide 
or fragment thereof in the sample. Such immunoassays are well known in the art 
and include immunoprecipitation assays, immunoblotting assays, immunolabeling 
assays, ELISA, etc. 

5 The present invention further provides a method of detecting a nucleic acid 

encoding a VKOR polypeptide in a sample, comprising contacting the sample with a 
nucleic acid of this invention that encodes VKOR or a fragment thereof, or a 
complement of a nucleic acid that encodes VKOR or a fragment thereof, under 
conditions whereby a hybridization complex can form, and detecting formation of a 

10 hybridization complex, thereby detecting a nucleic acid encoding a VKOR 

polypeptide in a sample. Such hybridization assays are well known in the art and 
include probe detection assays and nucleic acid amplification assays. 

Also encompassed by the invention is a method of obtaining a gene related to 
(i.e., a functional homologue of) the VKOR gene. Such a method entails obtaining or 

15 producing a detectably-labeled probe comprising an isolated nucleic acid which 
encodes all or a portion of VKOR, or a homolog thereof; screening a nucleic acid 
fragment library with the labeled probe under conditions that allow hybridization of 
the probe to nucleic acid fragments in the library, thereby forming nucleic acid 
duplexes; isolating labeled duplexes, if any; and preparing a full-length gene 

20 sequence from the nucleic acid fragments in any labeled duplex to obtain a gene 
related to the VKOR gene. 

A further aspect of the present invention is a method of making a vitamin K 
dependent protein which comprises culturing a host cell that expresses a nucleic 
acid encoding a vitamin K dependent protein in the presence of vitamin K and 

25 produces a vitamin K dependent protein, and then harvesting the vitamin K 
dependent protein from the culture, the host cell containing and expressing a 
heterologous nucleic acid encoding vitamin K dependent carboxylase, and the host 
cell further containing and expressing a heterologous nucleic acid encoding vitamin 
K epoxide reductase (VKOR) and producing VKOR as described herein. The 

30 expression of the VKOR-encoding nucleic acid and the production of the VKOR 
causes the cell to produce greater levels of the vitamin K dependent protein than 
would be produced in the absence of the VKOR. 

Thus, in some embodiments, the present invention also provides a method of 
producing a vitamin K dependent protein, comprising: 
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a) introducing into a cell a nucleic acid that encodes a vitamin K dependent 
protein under conditions whereby the nucleic acid is expressed and the vitamin K 
dependent protein is produced in the presence of vitamin K, wherein the cell 
comprises a heterologous nucleic acid encoding vitamin K dependent carboxylase 

5 and further comprises a heterologous nucleic acid encoding vitamin K epoxide 
reductase; and 

b) optionally collecting the vitamin K dependent protein from the cell. The 
vitamin K dependent protein that can be produced can be any vitamin K dependent 
protein now known or later identified as such, including but not limited to Factor VII, 

10 Factor IX, Factor X, Protein C, Protein S and prothrombin, in any combination. Any 
host cell that can be transformed with the nucleic acids described can be used as 
described herein, although in some embodiments non-human or even non- 
mammalian host cells can be used. Nucleic acids encoding vitamin K dependent 
carboxylase and nucleic acids encoding vitamin K dependent proteins as described 

1 5 herein are well known in the art and their introduction into cells for expression would 
be carried out according to routine protocols. 

Certain embodiments of this invention are based on the inventors' discovery 
that a subjects therapeutic dose of warfarin for anticoagulation therapy can be 
correlated with the presence of one or more single nucleotide polymorphisms in the 

20 VKOR gene of the subject. Thus, the present invention also provides a method of 
identifying a human subject having increased or decreased sensitivity to warfarin, 
comprising detecting in the subject the presence of a single nucleotide polymorphism 
(SNP) in the VKOR gene, wherein the single nucleotide polymorphism is correlated 
with increased or decreased sensitivity to warfarin, thereby identifying the subject as 

25 having increased or decreased sensitivity to warfarin. 

An example of a SNP correlated with an increased sensitivity to warfarin is a 
G->C alteration at nucleotide 2581 (SEQ ID NO:12) (in intron 2 of the VKOR gene; 
GenBank accession no. refSNP ID: rs8050894, incorporated by reference herein) of 
the nucleotide sequence of SEQ ID NO:11, which is a reference sequence 

30 encompassing the genomic sequence of SEQ ID NO:8 and approximately 1000 
nucleotides preceding and following this sequence. This sequence can be located 
as having the genome position "human chromosome 16p1 1 .2" or in the physical map 
in the NCBI database as human chromosome 16: 31009700-31013800. 
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Examples of SNPs correlated with a decreased sensitivity to warfarin are a 
T-»C alteration at nucleotide 3294 (SEQ ID NO: 13) (in intron 2 of the VKOR gene; 
GenBank accession no. refSNP ID: rs2359612, incorporated by reference herein) of 
the nucleotide sequence of SEQ ID NO:11 and a G->A alteration at nucleotide 4769 
5 (SEQ ID NO:14) (in the 3' UTR of the VKOR gene; GenBank accession no. refSNP 
ID: rs7294, incorporated by reference herein) of the nucleotide sequence of SEQ ID 
NO:11. 

As used herein, a subject having an "increased sensitivity to warfarin" is a 
subject for whom a suitable therapeutic or maintenance dose of warfarin is lower 

10 than the therapeutic or maintenance dose of warfarin that would suitable for a 
normal subject, i.e., a subject who did not carry a SNP in the VKOR gene that 
imparts a phenotype of increased sensitivity to warfarin. Conversely, as used herein, 
a subject having a "decreased sensitivity to warfarin" is a subject for whom a suitable 
therapeutic or maintenance dose of warfarin is higher than the therapeutic or 

15 maintenance dose of warfarin that would suitable for a normal subject, i.e., a subject 
who did not carry a SNP in the VKOR gene that imparts a phenotype of decreased 
sensitivity to warfarin. An example of a typical therapeutic dose of warfarin for a 
normal subject is 35 mg per week, although this amount can vary (e.g., a dose range 
of 3.5 to 420 mg per week is described in Aithal et al. (1999) Lancet 353:717-719). 

20 A typical therapeutic dose of warfarin can be determined for a given study group 
according to the methods described herein, which can be used to identify subjects 
with therapeutic warfarin doses above or below this dose, thereby identifying 
subjects having decreased or increased sensitivity to warfarin. 

Further provided herein is a method of identifying a human subject having 

25 increased or decreased sensitivity to warfarin, comprising: a) correlating the 
presence of a single nucleotide polymorphism in the VKOR gene with increased or 
decreased sensitivity to warfarin; and b) detecting the single nucleotide 
polymorphism of step (a) in the subject, thereby identifying a subject having 
increased or decreased sensitivity to warfarin. 

30 In addition, the present invention provides a method of identifying a single 

nucleotide polymorphism in the VKOR gene correlated with increased or decreased 
sensitivity to warfarin, comprising: a) identifying a subject having increased or 
decreased sensitivity to warfarin; b) detecting in the subject the presence of a single 
nucleotide polymorphism in the VKOR gene; and c) correlating the presence of the 
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single nucleotide polymorphism of step (b) with the increased or decreased 
sensitivity to warfarin in the subject, thereby identifying a single nucleotide 
polymorphism in the VKOR gene correlated with increased or decreased sensitivity 
to warfarin. 

5 Also provided herein is a method of correlating a single nucleotide 

polymorphism in the VKOR gene of a subject with increased or decreased sensitivity 
to warfarin, comprising: a) identifying a subject having increased or decreased 
sensitivity to warfarin; b) determining the nucleotide sequence of the VKOR gene of 
the subject of (a); c) comparing the nucleotide sequence of step (b) with the wild type 
10 nucleotide sequence of the VKOR gene; d) detecting a single nucleotide 

polymorphism in the nucleotide sequence of (b); and e) correlating the single 
nucleotide polymorphism of (d) with increased or decreased sensitivity to warfarin in 
the subject of (a). 

A subject is identified as having an increased or decreased sensitivity to 

1 5 warfarin by establishing a therapeutic or maintenance dose of warfarin for 
anticoagulation therapy according to well known protocols and comparing the 
therapeutic or maintenance dose for that subject with the therapeutic or maintenance 
dose of warfarin for anticoagulation therapy of a population of normal subjects (e.g., 
subjects lacking any SNPs in the VKOR gene correlated with increased or 

20 decreased sensitivity to warfarin) from which an average or mean therapeutic or 
maintenance dose of warfarin is calculated. A subject having a therapeutic or 
maintenance dose of warfarin that is below the average therapeutic or maintenance 
dose of warfarin (e.g., the dose of warfarin that is therapeutic or provides a 
maintenance level for a subject that has a wild type VKOR gene, i.e., lacking any 

25 single nucleotide polymorphisms associated with warfarin sensitivity) is a subject 
identified as having an increased sensitivity to warfarin. A subject having a 
therapeutic or maintenance dose of warfarin that is above the average therapeutic or 
maintenance of warfarin is a subject identified as having a decreased sensitivity to 
warfarin. An average therapeutic or maintenance dose of warfarin for a subject with 

30 a wild type VKOR gene would be readily determined by one skilled in the art. 

The nucleotide sequence of the VKOR gene of a subject is determined 
according to methods standard in the art, and as described in the Examples provided 
herein. For example, genomic DNA is extracted from cells of a subject and the 
VKOR gene is located and sequenced according to known protocols. Single 
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nucleotide polymorphisms in the VKOR gene are identified by a comparison of a 
subject's sequence with the wild type sequence as known in the art (e.g., the 
reference sequence as shown herein as SEQ ID NO: 11). 

A SNP in the VKOR gene is correlated with an increased or decreased sensitivity 

5 to warfarin by identifying the presence of a SNP or multiple SNPs in the VKOR gene 
of a subject also identified as having increased or decreased sensitivity to warfarin, 
i.e., having a maintenance or therapeutic dose of warfarin that is above or below the 
average dose and performing a statistical analysis of the association of the SNP or 
SNPs with the increased or decreased sensitivity to warfarin, according to well 

„10 known methods of statistical analysis. An analysis that identifies a statistical 

association (e.g., a significant association) between the SNP(s) (genotype) and 
increased or decreased warfarin sensitivity (phenotype) establishes a correlation 
between the presence of the SNP(s) in a subject and an increased or decreased 
sensitivity to warfarin in that subject. 

15 it is contemplated that a combination of factors, including the presence of one 

or more SNPs in the VKOR gene of a subject, can be correlated with an increased or 
decreased sensitivity to warfarin in that subject. Such factors can include, but are 
not limited to cytochrome p450 2C9 polymorphisms, race, age, gender, smoking 
history and hepatic disease. 

20 Thus, in a further embodiment, the present invention provides a method of 

identifying a human subject having increased or decreased sensitivity to warfarin, 
comprising identifying in the subject the presence of a combination of factors 
correlated with an increased or decreased sensitivity to warfarin selected from the 
group consisting of one or more single nucleotide polymorphisms of the VKOR gene, 

25 one or more cytochrome p450 2C9 polymorphisms, race, age, gender, smoking 
history, hepatic disease and any combination of two or more of these factors, 
wherein the combination of factors is correlated with increased or decreased 
sensitivity to warfarin, thereby identifying the subject having increased or decreased 
sensitivity to warfarin. 

30 Further provided herein is a method of identifying a human subject having 

increased or decreased sensitivity to warfarin, comprising: a) correlating the 
presence of a combination of factors with an increased or decreased sensitivity to 
warfarin, wherein the factors are selected from the group consisting of one or more 
single nucleotide polymorphisms of the VKOR gene, one or more cytochrome p450 

17 



WO 2005/030039 



PCT/US2004/031481 



2C9 polymorphisms, race, age, gender, smoking history, hepatic disease and any 
combination of two or more of these factors; and b) detecting the combination of 
factors of step (a) in the subject, thereby identifying a subject having increased or 
decreased sensitivity to warfarin. 
5 In addition, the present invention provides a method of identifying a 

combination of factors correlated with an increased or decreased sensitivity to 
warfarin, wherein the factors are selected from the group consisting of one or more 
single nucleotide polymorphisms of the VKOR gene, one or more cytochrome p450 
2C9 polymorphisms, race, age, gender, smoking history, hepatic disease and any 

10 combination of two or more of these factors, comprising: a) identifying a subject 
having increased or decreased sensitivity to warfarin; b) detecting in the subject the 
presence of a combination of the factors; and c) correlating the presence of the 
combination of factors of step (b) with the increased or decreased sensitivity to 
warfarin in the subject, thereby identifying a combination of factors correlated with 

1 5 increased or decreased sensitivity to warfarin. 

Also provided herein is a method of correlating a combination of factors, 
wherein the factors are selected from the group consisting of one or more single 
nucleotide polymorphisms of the VKOR gene, one or more cytochrome p450 2C9 
polymorphisms, race, age, gender, smoking history, hepatic disease and any 

20 combination of two or more of these factors, with increased or decreased sensitivity 
to warfarin, comprising: a) identifying a subject having increased or decreased 
sensitivity to warfarin; b) identifying the presence of a combination of the factors in 
the subject; and c) correlating the combination of the factors of (b) with increased or 
decreased sensitivity to warfarin in the subject of (a). 

25 A combination of factors as described herein is correlated with an increased 

or decreased sensitivity to warfarin by identifying the presence of the combination of 
factors in a subject also identified as having increased or decreased sensitivity to 
warfarin and performing a statistical analysis of the association of the combination of 
factors with the increased or decreased sensitivity to warfarin, according to well 

30 known methods of statistical analysis. An analysis that identifies a statistical 

association (e.g., a significant association) between the combination of factors and 
the warfarin sensitivity phenotype (increased or decreased) establishes a correlation 
between the presence of the combination of factors in a subject and an increased or 
decreased sensitivity to warfarin in that subject. 
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Further provided herein are nucleic acids encoding VKOR and comprising one 
or more SNPs as described herein. Thus, the present invention further provides 
nucleic acids comprising, consisting essentially of and/or consisting of the nucleotide 
sequence as set forth in SEQ ID NOs:12, 13, 14, 15 and 16. The nucleic acids can 
5 be present in a vector and the vector can be present in a cell. Further included are 
proteins encoded by a nucleic acid comprising a nucleotide sequence as set forth in 
SEQ ID IMOs:12, 13, 14, 15 and 16, as well as antibodies that specifically bind a 
protein encoded by a nucleic acid comprising a nucleotide sequence as set forth in 
SEQ ID NOs:12, 13, 14, 15 and 16. The present invention is more particularly 
10 described in the following examples that are intended as illustrative only since 

numerous modifications and variations therein will be apparent to those skilled in the 
art. 



EXAMPLES 

15 

EXAMPLE I CORRELATION BETWEEN SNPS IN VKOR GENE AND 
INCREASED OR DECREASED SENSITIVITY TO WARFARIN 

The most prevalent isoform of the VKOR gene is about 4 kb long, has three 
exons and encodes an enzyme of 163 amino acids with a mass of 18.4 kDa. In the 

20 present study, three mutations vk2581 (G>C), vk3294(T>C) and vk4769(G>A), 

identified as SNPs (heterozygosity ratios of 46.9%, 46.8% and 46.3%, respectively) 
were examined for a correlation between their presence in a subject and the 
maintenance dose of warfarin required to achieve a therapeutically effective 
response. 

25 1 . Selection of subjects 

Subjects were obtained from the UNC Coagulation Clinic in the Ambulatory 
Care Center. Informed consent was obtained by a trained genetic counselor. 
Subjects not fluent in English were excluded because of the lack of translators and 
the requirement for consent. To qualify for the study, subjects had warfarin for at 

30 least six months, were older than 1 8 and were followed by the UNC Coagulation 
clinic at the Ambulatory Care Clinic. 
2. Extraction of genomic DNA from whole blood 
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Genomic DNAs were extracted from the whole blood of subjects using 
QIAamp DNA Blood Mini Kit (QIAGEN cat#51 104). The DNA concentration was 
adjusted to 10 ng/pL 

3. Sequencing of the genomic DNA samples 
5 Approximately 1 0 ng of DNA was used for polymerase chain reaction (PCR) 

assays. The primers used to amplify the VKOR gene were: Exon 1-5' 
CCAATCGCCGAGTCAGAGG (SEQ ID NO:29) and Exon 1-3* 
CCCAGTCCCCAGCACTGTCT (SEQ ID NO:30) for the 5-UTR and Exon 1 region; 
Exon 2-5 1 AGGGGAGGATAGGGTCAGTG (SEQ ID NO:31) and Exon 2-3' 

10 CGTGTTAGTTACCTCCCCACA (SEQ ID NO:32) for the Exon 2 region; and Exon 3- 
5' ATACGTGCGTAAGCCACCAC (SEQ ID NO:33) and Exon 3-3' 
ACCCAGATATGCCCCCTTAG (SEQ ID NO:34)forthe Exon3 and 3'-UTR region. 
Automated high throughput capillary electrophoresis DNA sequencing was used for 
detecting SNPs in the VKOR gene. 

15 4. Detection of known SNPs using real-time PCR 

The assay reagents for SNP genotyping were from the Assay-by-Design™ 
service (Applied Biosystems, cat#4332072). The primers and probes (FAM™ and 
VIC™ dye-labeled) were designed using Primer Express software and were 
synthesized in an Applied Biosystems synthesizer. The primer pairs for each SNP 

20 are located at the upstream/downstream position of the SNP site and can generate 
less than 100 bp length of a DNA fragment in the PCR reaction. The FAM™ and 
VIC™ dye-labeled probes were designed to cover the SNP sites with a length of 15- 
16 nt. The primer and probe sequences for each VKOR SNP are shown in Table 2. 
The 2X TaqMan™ Universal PCR Master Mix, No AmpErase UNG (Applied 

25 Biosystems, cat#4324018) was used in the PCR reactions. Forty cycles of real-time 
PCR were performed in an Opticon II (MJ Research) machine. There was a 10 
minute 95°C preheat followed by 92°C for 15 sec, 60°C for 1 min. and then a plate 
reading. The results were read according to the signal value of FAM and VIC dye. 
5. Statistical analysis 

30 The difference of average dose between different genotypes was compared 

by analysis of variance (ANOVA) using SAS version 8.0 (SAS, Inc., Cary, NC). A 
two-sided p value less than 0.05 was considered significant. Examination of the 
distribution and residuals for the average dose of treatment among the SNP groups 
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indicated that a log transformation was necessary to satisfy the assumption of 
homogeneity of variance. 

6. Correlation of SNPs with warfarin dosage 

By direct genomic DNA sequencing and SNP real-time PCR detection, five 
5 SNPs were identified in the VKOR gene: one in the 5-UTR, two in intron II, one in 
the coding region and one in the 3'-UTR (Table 1). 

Among these SNPs, the vk563 and vk4501 SNPs allele were carried by only 
one of the 58 subjects of the study (a triple heterozygous, also carrying the 3-UTR 
SNP allele), while the other SNPs were identified in 17-25 heterozygous patients. 

10 Each marker was first analyzed independently. Figure 1 A shows that the 

average warfarin dose for patients with the vk2581 wild type allele was 50.19±3.20 
mg per week (n=26), while those heterozygous and homozygous for this 
polymorphism were 35.19±3.73 (n=17) and 31.14±6.2 mg per week (n=15), 
respectively. Figure 1 B shows that the average warfarin dose for patients with the 

1 5 wild-type vk3294 allele was 25.29±3.05 mg per week (n=1 1 ), while patients bearing 
the heterozygous and homozygous alleles were 41.68±4.92 (n=25) and 47.73±2.75 
mg per week (n=22), respectively. Figure 1C shows the average warfarin dose for 
patients with vk4769 SNP wild type was 35.35±4.01 mg per week (n=27), while 
patients with the heterozygous and homozygous alleles required 44.48±4.80 (n=19) 

20 and 47.56±3.86 mg per week (n=12), respectively. It was also observed that P450 
2C9 *3 has a significant effect on warfarin dose (Figure 1D), as previously reported 
(Joffe et al. (2004) 'Warfarin dosing and cytochrome P450 2C9 polymorphisms" 
Thromb Haemost 91 :1 123-1 128). The average warfarin dose for patients with P450 
2C9 *1 (wild type) was 43.82±2.75 mg per week (n=50), while patients heterozygous 

25 for this allele required 22.4±4.34 mg per week (n=8). 

7. Statistical analysis 

The association of the Log e (warfarin average dosage)(LnDose) with the 
SNPs in the VKOR gene was examined by analysis of variance (ANOVA). SAS was 
used first to do a repeated procedure in which a series of factors (race, gender, 
30 smoking history, hepatic diseases, SNPs at cytochrome P450 2Y9 gene, etc.) were 
examined to identify factors, excluding VKOR SNPs, which might affect dosage. 
P450 2C9 *3 was significantly associated with the average dose of warfarin; thus, it 
was included as a covariant for further analysis. The analysis indicated that the- 
three VKOR SNPs were still significantly associated with weekly warfarin dose 
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(vk2581 , P < 0.0001 ; vk3294, P <0.0001 ; and vk4769 t P = 0.0044), when the 
covariance is included. 

To specifically test if the three SNPs of VKOR were independently associated 
with warfarin dosage, the analysis was repeated in which two SNPs in the VKOR 
5 gene were included as covariates for the other SNP. The three VKOR SNPs are 
located within 2 kb distance of one another and are expected to be closely linked. It 
was clear from inspection that, at least for Caucasians, one haplotype (where 
A=vk2581 guanine and a=vk2581 cytosine; B=vk3294 thymine and b=vk3924 
cytosine; C=vk4769 guanine and c=vk4769 adenine) was AAbbcc and another 
10 aaBBCC. The distribution of individual SNPs in patients was found to be significantly 
correlated with the others (R=0.63-0.87, p<0.001). Indeed, subjects with the 
haplotype AAbbcc (n=7) required a significantly higher dosage of warfarin (warfarin 
dosage=48.98±3.93) compared to those patients with haplotype aaBBCC 
(25.29±3.05; p<0.001). 

15 

EXAMPLE 2 siRNA DESIGN AND SYNTHESIS 

siRNAs were selected using an advanced version of a rational design 
algorithm (Reynolds et al. (2004) "Rational siRNA design for RNA interference" 
Nature Biotechnology 22:326-330). For each of the 13 genes, four siRNAs duplexes 

20 with the highest scores were selected and a BLAST search was conducted using the 
Human EST database. To minimize the potential for off-target silencing effects, only 
those sequence targets with more than three mismatches against un-related 
sequences were selected (Jackson et al. (2003) "Expression profiling reveals off- 
target gene regulation by RNAi" Nat Biotechnol 21:635-7). All duplexes were 

25 synthesized in Dharmacon (Lafayette, CO) as 21-mers with UU overhangs using a 
modified method of 2'-ACE chemistry (Scaringe (2000) "Advanced 5'-silyl-2'- 
orthoester approach to RNA oligonucleotide synthesis" Methods Enzymol 317:3-18) 
and the AS strand was chemically phosphorylated to ensure maximum activity 
(Martinez et al. (2002) "Single-stranded antisense siRNAs guide target RNA 

30 cleavage in RNAi" Cell 1 1 0:563-74). 



22 



WO 2005/030039 



PCT/US2004/031481 



EXAMPLE 3 siRNA transfection 

Transfection was essentially as previously described (Harborth et al. (2001) 
"Identification of essential genes in cultured mammalian cells using small interfering 
RNAs" J Cell Sci 1 14:4557-65) with minor modifications. 

5 

EXAMPLE 4 VKOR activity assay 

siRNA transfected A549 cells were trypsinized and washed twice with cold 
PBS. 1.5x10 7 cells were taken for each VKOR assay. 200 pL buffer D (250 mM 
Na 2 HP04-NaH 2 P04, 500 mM KCI, 20% glycerol and 0.75% CHAPS, pH 7.4) was 

1 0— added to the cell pellet, followed by sonication of the cell lysate. For assays of 
solubilized microsomes, microsomes were prepared from 2x1 0 9 cells as described 
(Lin et al. (2002) 'The putative vitamin K-dependent gamma-glutamyl carboxylase 
internal propeptide appears to be the propeptide binding site" J Biol Chem 
277:28584-91); 10 to 50 pL of solubilized microsomes were used for each assay. 

15 Vitamin K epoxide was added to the concentration indicated in the figure legends 
and DTT was added to 4 mM to initiate the reaction. The reaction mixture was 
incubated in yellow light at 30°C for 30 minutes and stopped by adding 500 pL 0.05 
M AgN0 3 : isopropanol (5:9). 500 pL hexane was added and the mixture was 
vortexed vigorously for 1 minute to extract the vitamin K and KO. After 5 minutes 

20 centrifugation, the upper organic layer was transferred to a 5-mL brown vial and 
dried with N 2 . 150 pL HPLC buffer, acetonitrile:isopropanol:water (100:7:2), was 
added to dissolve the vitamin K and KO and the sample was analyzed by HPLC on 
an A C-18 column (Vydac, cat#218TP54). 

25 EXAMPLE 5 RT-qPCR (reverse transcriptase quantitative PCR) 

1x10 6 cells were washed with PBS twice and total RNA was isolated with 
Trizol reagent according to the manufacturer's protocol (Invitrogen). 1 pg of RNA 
was digested by RQ1 DNasel (Promega) and heat-inactivated. First strand cDNA 
was made with M-MLV reverse transcriptase (Invitrogen). cDNAs were mixed with 
30 DyNAmo SYBR Green qPCR pre-mix (Finnzymes) and real-time PCR was 
performed with an Opticon II PCR thermal cycler (MJ Research). The following 
primers were used: 

13124769-5' (F): (TCCAACAGCATATTCGGTTGC, SEQ ID NO: 1); 
13124769-3 (R)': (TTCTTGGACCTTCCGGAAACT, SEQ ID NO: 2); 
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GAPDH-F: (GAAGGTGAAGGTCGGAGTC, SEQ ID NO: 3); 
GAPDH-R: (GAAGATGGTGATGGGATTTC, SEQ ID NO: 4); 
Lamin-RT-F: (CTAGGTGAGGCCAAGAAGCAA, SEQ ID NO: 5) and 
Lamin-RT-R: (CTGTTCCTCTCAGCAGACTGC, SEQ ID NO: 6). 

5 

EXAMPLE 6 Over-expression of VKOR in Sf9 insect cell line 

The cDNA for the mGC1 1276 coding region was cloned into pVL1392 
(Pharmingen), with the HPC4 tag (EDQVDPRLIDGK, SEQ ID NO: 7) at its amino 
terminus and expressed in Sf9 cells as described (Li et al. (2000) "Identification of a 
10 Drosophila vitamin K-dependent gamma-glutamyl carboxylase" J Biol Chem 
275:18291-6). 

EXAMPLE 7 Gene selection 

The search for the VKOR gene was focused on human chromosome sixteen 

15 between markers D16S3131 and D16S419. This region corresponds to chromosome 
16 at 50cM-65cM on the genetic map and 26-46. 3Mb on the physical map. 190 
predicted coding sequences in this region were analyzed by a BLASTX search of the 
NCBI non-redundant protein database. Those human genes and orthologs from 
related species with known function were eliminated. Because VKOR appears to be 

20 a transmembrane protein (Carlisle & Suttie (1980) 'Vitamin K dependent 

carboxylase: subcellular location of the carboxylase and enzymes involved in vitamin 
K metabolism in rat liver" Biochemistry 19:1 161-7), the remaining genes were 
translated according to the cDNA sequences in the NCBI database and analyzed 
with the programs TMHMM and TMAP (Biology WorkBench, San Diego 

25 Supercomputer System) to predict those with transmembrane domains. Thirteen 
genes predicted to code for integral membrane proteins were chosen for further 
analysis. 

EXAMPLE 8 Cell line screening for VKOR activity 
30 The strategy was to identify a cell line expressing relatively high amounts of 

VKOR activity and use siRNA to systematically knock down all thirteen candidate 
genes. siRNA, double stranded RNA of 21-23 nucleotides, has been shown to cause 
specific RNA degradation in cell culture (Hara et al. (2002) "Raptor, a binding partner 
of target of rapamycin (TOR), mediates TOR action" Cell 110:177-89; Krichevsky & 

24 
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Kosik (2002) "RNAi functions in cultured mammalian neurons" Proc Natl Acad Sci 
USA 99:1 1926-9; Burns et al. (2003) "Silencing of the Novel p53 Target Gene 
Snk/Plk2 Leads to Mitotic Catastrophe in Paclitaxel (Taxol)-Exposed Cells" Mol Cell 
8/0/23:5556-71). However, application of siRNA for large scale screening in 
5 mammalian cells has not previously been reported because of the difficulty in 
identifying a functional target for a specific mammalian cell mRNA (Holen et al. 
(2003) "Similar behaviour of single-strand and double-strand siRNAs suggests they 
act through a common RNAi pathway" Nucleic Acids Res 31:2401-7). The 
development of a rational selection algorithm (Reynolds et al.) for siRNA design 

10 increases the probability that a specific siRNA can be developed; furthermore, the 
probability of success can be increased by pooling four rationally selected siRNAs. 
Using siRNA to search for previously unidentified genes has the advantage that, 
even if VKOR activity requires the product of more than one gene for activity, the 
screen should still be effective because the assay determines the loss of enzymatic 

15 activity. 

Fifteen cell lines were screened and a human lung carcinoma line, A549, was 
identified to exhibit sufficient warfarin-sensitive VKOR activity for facile 
measurement. A second human colorectal adenocarcinoma cell line, HT29, which 
expressed very little VKOR activity, was used as a reference. 

20 

EXAMPLE 9 siRNA inhibition of VKOR activity in A549 cells 

Each of the thirteen pools of siRNA were transfected in triplicate into A549 
cells and assayed for VKOR activity after 72 hours. One siRNA pool specific for 
gene gi:1 3124769 reduced VKOR activity by 64%-70% in eight separate assays 
25 (Fig. 2). 

One possible reason that VKOR activity was inhibited to only -35% of its 
initial activity after 72 hours is that the half-life of mammalian proteins varies greatly 
(from minutes to days) (Zhang et al. (1996) 'The major calpain isozymes are long- 
lived proteins. Design of an antisense strategy for calpain depletion in cultured cells" 
30 J Biol Chem 271 :1 8825-30; Bohley (1 996) "Surface hydrophobicity and intracellular 
degradation of proteins" Biol Chem 377:425-35; Dice & Goldberg (1975) 
"Relationship between in vivo degradative rates and isoelectric points of proteins" 
Proc Natl Acad Sci USA 72:3893-7), and mRNA translation is being inhibited, not 
enzyme activity. Therefore, the cells were carried through eleven days and their 

25 
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VKOR activity followed. Figure 3 shows that the level of mRNA for gi:13124769 
mRNA decreased rapidly to about 20% of normal while VKOR activity decreased 
continuously during this time period. This reduction in activity is not a general effect 
of the siRNA or the result of cell death because the level of VKD carboxylase activity 
5 and lamin A/C mRNA remained constant. Furthermore, the level of gi: 1321 24769 
mRNA is four fold lower in HT-29 cells, which have low VKOR activity, than in A549 
cells that exhibit high VKOR activity. These data indicate that gi:13124769 
corresponds to the VKOR gene. 



10 EXAMPLE 10 Identification of gene encoding VKOR 

The gene, IMAGE 3455200 (gi:13124769, SEQ ID NO: 8), identified herein to 
encode VKOR, maps to human chromosome 16p11.2, mouse chromosome 7F3, 
and rat chromosome 1:180.8 Mb. There are 338 cDNA clones in the NCBI database 
representing seven different splicing patterns (NCBI AceView program). These are 

15 composed of all or part of two to four exons. Among these, the most prevalent 
isoform, mGC1 1276, has three exons and is expressed at high levels in lung and 
liver cells. This three exon transcript (SEQ ID NO: 9) encodes a predicted protein of 
163 amino acids with a mass of 18.2 kDa (SEQ ID NO: 10). It is a putative N- 
myristylated endoplasmic reticulum protein with one to three transmembrane 

20 domains, depending upon the program used for prediction. It has seven cysteine 
residues, which is consistent with observations that the enzymatic activity is 
dependent upon thiol reagents (Thijssen et al. (1994) "Microsomal lipoamide 
reductase provides vitamin K epoxide reductase with reducing equivalents" Biochem 
J 297:277-80). Five of the seven cysteines are conserved among human, mice, rat, 

25 zebrafish, Xenopus and Anopheles. 

To confirm that the VKOR gene had been identified, the most prevalent form 
of the enzyme (three exons) was expressed in Spodoptera frugiperda, Sf9 cells. Sf9 
cells have no measurable VKOR activity but exhibit warfarin sensitive activity when 
transfected with mGC1 1276 cDNA (Figure 4). VKOR activity is observed from 

30 constructs with an epitope tag at either their amino or carboxyl terminus. This tag 
should assist in the purification of VKOR. 

VKOR should exhibit warfarin sensitivity, therefore microsomes were made 
from Sf9 cells expressing VKOR and tested for warfarin sensitivity. The VKOR 
activity is warfarin-sensitive (Figure 5). 

26 
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In summary, the present invention provides the first example of using siRNA 
in mammalian cells to identify an unknown gene. The identity of the VKOR gene was 
confirmed by its expression in insect cells. The VKOR gene encodes several 
isoforms. It will be important to characterize the activity and expression pattern of 
each isoform. Millions of people world-wide utilize warfarin to inhibit coagulation; 
therefore it is important to further characterize VKOR as it can lead to more accurate 
dosing or design of safer, more effective, anti-coagulants. 

The foregoing is illustrative of the present invention, and is not to be 
construed as limiting thereof. The invention is defined by the following claims, with 
equivalents of the claims to be included therein. 

All publications, patent applications, patents, patent publications and 
other references cited herein are incorporated by reference in their entireties for the 
teachings relevant to the sentence and/or paragraph in which the reference is 
presented. 
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Table 1. Five SNPs examined in VKOR gene 



SNPs 


posit 
ion 


AA change 


Heterozygous ratio 


vk563 
G>A 
(SEQ ID 
NO:15) 


5'- 
UTR 


N/A 


1/58 


vk2581 G>C 
(SEQ ID 
NO:12) 


Intro 
n2 


N/A 


17/58 


vk3294 T>C 
(SEQ ID 
NO:13) 


Intro 
n2 


N/A 


25/58 


vk4501 OT 
(SEQ ID 
NO:16 


Exo 
n3 


Leu120Leu 


1/58 


vk4769 G>A 
(SEQ ID 
NO:14 


3'- 
UTR 


N/A 


19/58 
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THAT WHICH IS CLAIMED IS: 

1. A method of identifying a human subject having increased sensitivity to 
warfarin, comprising detecting in the subject the presence of a single nucleotide 
polymorphism in the VKOR gene, wherein the single nucleotide polymorphism is 
correlated with increased sensitivity to warfarin, thereby identifying the subject 
having increased sensitivity to warfarin. 

2. The method of claim 1, wherein the single nucleotide polymorphism in the 
VKOR gene is a G->C alteration at nucleotide 2581 of the nucleotide sequence of 
SEQ ID NO:11. 

3. A method of identifying a human subject having increased sensitivity to 
warfarin, comprising: 

a) correlating the presence of a single nucleotide polymorphism in the VKOR 
gene with increased sensitivity to warfarin; and 

b) detecting the single nucleotide polymorphism of step (a) in the subject, 
thereby identifying a subject having increased sensitivity to warfarin. 

4. A method of identifying a single nucleotide polymorphism in the VKOR 
gene correlated with increased sensitivity to warfarin, comprising: 

a) identifying a subject having increased sensitivity to warfarin; 

b) detecting in the subject the presence of a single nucleotide polymorphism 
in the VKOR gene; and 

c) correlating the presence of the single nucleotide polymorphism of step (b) 
with the increased sensitivity to warfarin in the subject, thereby identifying a single 
nucleotide polymorphism in the VKOR gene correlated with increased sensitivity to 
warfarin. 

5. A method of correlating a single nucleotide polymorphism in the VKOR 
gene of a subject with increased sensitivity to warfarin, comprising: 

a) identifying a subject having increased sensitivity to warfarin; 
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b) determining the nucleotide sequence of the VKOR gene of the subject of 

c) comparing the nucleotide sequence of step (b) with the wild type nucleotide 
sequence of the VKOR gene; 

d) detecting a single nucleotide polymorphism in the nucleotide sequence of 
(b); and 

e) correlating the single nucleotide polymorphism of (d) with increased 
sensitivity to warfarin in the subject of (a). 

6. A method of identifying a human subject having decreased sensitivity to 
warfarin, comprising detecting in the subject the presence of a single nucleotide 
polymorphism in the VKOR gene, wherein the single nucleotide polymorphism is 
correlated with decreased sensitivity to warfarin, thereby identifying the subject 
having decreased sensitivity to warfarin. 

7. The method of claim 6, wherein the single nucleotide polymorphism in the 
VKOR gene is a T->C alteration at nucleotide 3294 of the nucleotide sequence of 
SEQ ID NO:11. 

8. The method of claim 6, wherein the single nucleotide polymorphism in the 
VKOR gene is a G->A alteration at nucleotide 4769 of the nucleotide sequence of 
SEQIDNO:11. 

9. A method of identifying a human subject having decreased sensitivity to 
warfarin, comprising: 

a) correlating the presence of a single nucleotide polymorphism in the VKOR 
gene with decreased sensitivity to warfarin; and 

b) detecting the single nucleotide polymorphism of step (a) in the subject, 
thereby identifying a subject having decreased sensitivity to warfarin. 

10. A method of identifying a single nucleotide polymorphism in the VKOR 
gene correlated with decreased sensitivity to warfarin, comprising: 

a) identifying a subject having decreased sensitivity to warfarin; 
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b) detecting in the subject the presence of a single nucleotide polymorphism 
in the VKOR gene; and 

c) correlating the presence of the single nucleotide polymorphism of step (b) 
with the decreased sensitivity to warfarin in the subject, thereby identifying a single 
nucleotide polymorphism in the VKOR gene correlated with decreased sensitivity to 
warfarin. 

11. A method of correlating a single nucleotide polymorphism in the VKOR 
gene of a subject with decreased sensitivity to warfarin, comprising: 

a) identifying a subject having decreased sensitivity to warfarin; 

b) determining the nucleotide sequence of the VKOR gene of the subject of 

(a) ; 

c) comparing the nucleotide sequence of step (b) with the wild type nucleotide 
sequence of the VKOR gene; 

d) detecting a single nucleotide polymorphism in the nucleotide sequence of 

(b) ; and 

e) correlating the single nucleotide polymorphism of (d) with decreased 
sensitivity to warfarin in the subject of (a). 

12. In a method of making a vitamin K dependent protein which comprises 

a) culturing a host cell which expresses a nucleic acid encoding a vitamin K 
dependent protein in the presence of vitamin K and produces a vitamin K dependent 
protein, and 

b) harvesting said vitamin K dependent protein from the culture, 

said host cell containing and expressing a heterologous nucleic acid encoding 
vitamin K dependent carboxylase, the improvement comprising: 

employing as said host cell a host cell further containing and expressing a 
heterologous nucleic acid encoding vitamin K epoxide reductase (VKOR). 

13. The method of claim 12, wherein said vitamin K dependent protein is 
selected from the group consisting of Factor VII, Factor IX, Factor X, Protein C, 
Protein S, and prothrombin. 

14. The method of claim 12, wherein said host cell is a plant cell. 



32 



WO 2005/030039 PCTAJS2004/031481 

15. The method of claim 12, wherein said host cell is an insect cell. 

16. The method of claim 12, wherein said vitamin K-dependent carboxylase is 
bovine vitamin K dependent carboxylase. 
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SEQUENCE LISTING 

<110> University of North Carolina -Chapel Hill 
Stafford, Darrel 
Li, Tao 

<120> IDENTIFICATION OF THE GENE FOR VITAMIN K EPOXIDE REDUCTASE 
<130> 5470.401WO 

<150> US 60/505,527 
<151> 2003-09-23 

<160> 34 

<170> Patentln version 3.2 

<210> 1 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide primer 

<400> 1 

tccaacagca tattcggttg c 21 



<210> 2 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide primer 

<400> 2 

ttcttggacc ttccggaaac t 21 



<210> 3 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide primer 

<400> 3 

gaaggtgaag gtcggagtc 19 



<210> 4 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide primer 
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<400> 4 

gaagatggtg atgggatttc 20 



<210> 5 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide primer 

<400> 5 

ctaggtgagg ccaagaagca a 21 



<210> 6 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide primer 

<400> 6 

ctgttcctct cagcagactg c 21 



<210> 7 
<211> 12 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> HPC4 tag sequence 
<400> 7 

Glu Asp Gin Val Asp Pro Arg Leu lie Asp Gly Lys 
15 10 



<210> 8 

<211> 3915 

<212> DNA 

<213> Homo sapiens 






<400> 8 
ggttttctcc 


gcgggcgcct 


cgggcggaac ctggagataa tgggcagcac ctgggggagc 


60 


cctggctggg 


tgcggctcgc 


tctttgcctg acgggcttag tgctctcgct ctacgcgctg 


120 


cacgtgaagg 


cggcgcgcgc 


ccgggaccgg gattaccgcg cgctctgcga cgtgggcacc 


180 


gccatcagct 


gttcgcgcgt 


cttctcctcc aggtgtgcac gggagtggga ggcgtggggc 


240 


ctcggagcag 


ggcggccagg 


atgccagatg attattctgg agtctgggat cggtgtgccc 


300 


ggggaacgga 


cacggggctg 


gactgctcgc ggggtcgttg cacaggggct gagctaccca 


360 


gcgatactgg 


tgttcgaaat 


aagagtgcga ggcaagggac cagacagtgc tggggactgg 


420 
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gattattccg gggactcgca 


cgtgaattgg 


atgccaagga 


ataacggtga 


ccaggaaagg 


4B0 


cggggaggca ggatggcggt 


agagattgac 


gatggtctca 


aggacggcgc 


gcaggtgaag 


540 


gggggtgttg gcgatggctg 


cgcccaggaa 


caaggtggcc 


cggtctggct 


gtgcgtgatg 


600 


gccaggcgtt agcataatga 


cggaatacag 


aggaggcgag 


tgagtggcca 


gggagctgga 


660 


gattctgggg tccagggcaa 


agataatctg 


cccccgactc 


ccagtctctg 


atgcaaaacc 


720 


gagtgaaccg ttataccagc 


cttgccattt 


taagaattac 


ttaagggccg 


ggcgcggtgg 


780 


cccactcctg taatcccagc 


actttgggag 


gccgaggcgg 


atggatcact 


tgaagtcagg 


840 


agttgaccag cctggccaac 


atggtgaaag 


cctgtctcta 


ccaaaaatag 


aaaaattaat 


900 


cgggcgctat ggcgggtgcc 


ttaatcccag 


ctactcgggg 


gggctaaggc 


aggagaatcg 


960 


cttgaacccg ggaggcggag 


gtttcagtga 


gccgagatcg 


cgccactgca 


ctccagcctg 


1020 


ggccagagtg agactccgtc 


tcaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaag 


agacttactt 


1080 


aaggtctaag atgaaaagca 


gggcctacgg 


agtagccacg 


tccgggcctg 


gtctggggag 


1140 


aggggaggat agggtcagtg 


acatggaatc 


ctgacgtggc 


caaaggtgcc 


cggtgccagg 


1200 


agatcatcga cccttggact 


aggatgggag 


gtcggggaac 


agaggatagc 


ccaggtggct 


1260 


tcttggaaat cacctttctc 


gggcagggtc 


caaggcactg 


ggttgacagt 


cctaacctgg 


1320 


ttccacccca ccccacccct 


ctgccaggtg 


gggcaggggt 


ttcgggctgg 


tggagcatgt 


1380 


gctgggacag gacagcatcc 


tcaatcaatc 


caacagcata 


ttcggttgca 


tcttctacac 


1440 


actacagcta ttgttaggtg 


agtggctccg 


ccccctccct 


gcccgccccg 


ccccgcccct 


1500 


catccccctt ggtcagctca 


gccccactcc 


atgcaatctt 


ggtgatccac 


acagctgaca 


1560 


gccagctagc tgctcatcac 


ggagcgtcct 


gcgggtgggg 


atgtggggag 


gtaactaaca 


1620 


ggagtctttt aattggttta 


agtactgtta 


gaggctgaag 


ggcccttaaa 


gacatcctag 


1680 


gtccccaggt tttttgtttg 


ttgttgtttt 


gagacagggt 


ctggctctgt 


tgcccaaagt 


1740 


gaggtctagg atgcccttag 


tgtgcactgg 


cgtgatctca 


gttcatggca 


acctctgcct 


1800 


ccctgcccaa gggatcctcc 


caccttagcc 


tcccaagcag 


ctggaatcac 


aggcgtgcac 


1860 


cactatgccc agctaatttt 


tgtttttgtt 


tttttttggt 


agagatggtg 


tctcgccatg 


1920 


ttgcccaggc tggtctcaag 


caatctgtct 


gcctcagcct 


cccaaagtgc 


tggggggatt 


1980 


acaggcgtga gctaccatgc 


cccaccaaca 


ccccagtttt 


gtggaaaaga 


tgccgaaatt 


2040 


cctttttaag gagaagctga 


gcatgagcta 


tcttttgtct 


catttagtgc 


tcagcaggaa 


2100 


aatttgtatc tagtcccata 


agaacagaga 


gaggaaccaa 


gggagtggaa 


gacgatggcg 


2160 


ccccaggcct tgctgatgcc 


atatgccgga 


gatgagacta 


tccattacca 


cccttcccag 


2220 
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caggctccca cgctcccttt gagtcaccct tcccagctcc agagaaggca tcactgaggg 2280 

aggcccagca ccatggtcct ggctgacaca tggttcagac ttggccgatt tatttaagaa 2340 

attttattgc tcagaacttt ccctccctgg gcaatggcaa gagcttcaga gaccagtccc 2400 

ttggagggga cctgttgaag ccttcttttt tttttttttt aagaaataat cttgctctgt 2460 

tgcccaggct ggagtgcagt ggcacaatca tagctcactg taacctggct caagcgatcc 2520 

tcctgagtag ctaggactat aggcatgtca ctgcacccag ctaatttttt tttttttttt 2580 

tttttttttt ttgcgacata gtctcgctct gtcaccaggc tggagtgcag tggcacgatc 2640 

ttggctcact gcaacctctg cctcccgggt tcaagcaatt ttcctgcctc agcctcctga 2700 

gtagctggga ctacaggcgc gtgtcaccac gcccagctaa tttttgtatt tttagtggag 2760 

acagggtttc accatgttgg ctaggatggt ctcaatctct tgacctggtg atccatccgc 2820 

cttggcctcc caaagtgcta ggattacagg cgtgagtcaa cctcaccggg catttttttt 2880 

ttgagacgaa gtcttgctct tgctgcccaa gctggaatgt ggtggcatga tctcggctca 2940 

ctgcaacctc cacctcctag gttcaagcga ttctccacct tagcctcccc agcagctggg 3000 

attacaggtg cccatcaaca cacccggcta atttttgtat ttttattaga gatggggttt 3060 

tgccatgttg gccaggctgc tctcgaactc ctaacctcag gtgatccacc cccattggcc 3120 

tcccaaaata ctgggattac aggcatgagc caccgtgccc agctgaattt ctaaattttt 3180 

gatagagatc gggtctttct atgttgccca agctggtctt gaactcctag cctaaagcag 3240 

tcttcccacc tcggcctccc agagtgtttg gaatacgtgc gtaagccacc acatctgccc 3300 

tggagcctct tgttttagag acccttccca gcagctcctg gcatctaggt agtgcagtga 3360 

catcatggag tgttcgggag gtggccagtg cctgaagccc acaccggacc ctcttctgcc 3420 

ttgcaggttg cctgcggaca cgctgggcct ctgtcctgat gctgctgagc tccctggtgt 3480 

ctctcgctgg ttctgtctac ctggcctgga tcctgttctt cgtgctctat gatttctgca 3540 

ttgtttgtat caccacctat gctatcaacg tgagcctgat gtggctcagt ttccggaagg 3600 

tccaagaacc ccagggcaag gctaagaggc actgagccct caacccaagc caggctgacc 3660 

tcatctgctt tgctttggca tgtgagcctt gcctaagggg gcatatctgg gtccctagaa 3720 

ggccctagat gtggggcttc tagattaccc cctcctcctg ccatacccgc acatgacaat 3780 

ggaccaaatg tgccacacgc tcgctctttt ttacacccag tgcctctgac tctgtcccca 3840 

tgggctggtc tccaaagctc tttccattgc ccagggaggg aaggttctga gcaataaagt 3900 

ttcttagatc aatca 3915 

<210> 9 
<211> 806 
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<212> DNA 

<2l3> Homo sapiens 



<220> 

<221> CDS 

<222> (48).. (536) 

<400> 9 

ggcacgaggg ttttctccgc gggcgcctcg ggcggaacct ggagata atg ggc age 56 

Met Gly Ser 
1 

acc tgg ggg age cct ggc tgg gtg egg etc get ctt tgc ctg acg ggc 104 
Thr Trp Gly Ser Pro Gly Trp Val Arg Leu Ala Leu Cys Leu Thr Gly 
5 10 15 

tta gtg etc teg etc tac gcg ctg cac gtg aag gcg gcg cgc gee egg 152 
Leu Val Leu Ser Leu Tyr Ala Leu His Val Lys Ala Ala Arg Ala Arg 
20 25 30 35 

gac egg gat tac cgc gcg etc tgc gac gtg ggc acc gee ate age tgt 200 
Asp Arg Asp Tyr Arg Ala Leu Cys Asp Val Gly Thr Ala lie Ser Cys 
40 45 50 

teg cgc gtc ttc tec tec agg tgg ggc agg ggt ttc ggg ctg gtg gag 248 
Ser Arg Val Phe Ser Ser Arg Trp Gly Arg Gly Phe Gly Leu Val Glu 
55 60 65 

cat gtg ctg gga cag gac age ate etc aat caa tec aac age ata ttc 296 
His Val Leu Gly Gin Asp Ser lie Leu Asn Gin Ser Asn Ser lie Phe 
70 75 80 

ggt tgc ate ttc tac aca eta cag eta ttg tta ggt tgc ctg egg aca 344 
Gly Cys lie Phe Tyr Thr Leu Gin Leu Leu Leu Gly Cys Leu Arg Thr 
85 90 95 

cgc tgg gee tct gtc ctg atg ctg ctg age tec ctg gtg tct etc get 392 
Arg Trp Ala Ser Val Leu Met Leu Leu Ser Ser Leu Val Ser Leu Ala 
100 105 110 115 

ggt tct gtc tac ctg gee tgg ate ctg ttc ttc gtg etc tat gat ttc 440 
Gly Ser Val Tyr Leu Ala Trp He Leu Phe Phe Val Leu Tyr Asp Phe 
120 125 130 

tgc att gtt tgt ate acc acc tat get ate aac gtg age ctg atg tgg 488 
Cys He Val Cys He Thr Thr Tyr Ala He Asn Val Ser Leu Met Trp 
135 140 145 

etc agt ttc egg aag gtc caa gaa ccc cag ggc aag get aag agg cac 536 
Leu Ser Phe Arg Lys Val Gin Glu Pro Gin Gly Lys Ala Lys Arg His 
150 155 160 

tgagccctca acccaagcca ggctgacctc atetgetttg ctttggcatg tgagccttgc 596 

ctaagggggc atatctgggt ccctagaagg ccctagatgt ggggcttcta gattaccccc 656 

tcctcctgcc atacccgcac atgacaatgg accaaatgtg ccacacgctc gctctttttt 716 

acacccagtg cctctgactc tgtccccatg ggctggtctc caaagctctt tccattgccc 776 
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agggagggaa ggttctgagc aataaagttt 806 



<210> 10 

<211> 163 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Gly Ser Thr Trp Gly Ser Pro Gly Trp Val Arg Leu Ala Leu Cys 
15 10 15 



Leu Thr Gly Leu Val Leu Ser Leu Tyr Ala Leu His Val Lys Ala Ala 
20 25 30 



Arg Ala Arg Asp Arg Asp Tyr Arg Ala Leu Cys Asp Val Gly Thr Ala 
35 40 45 



lie Ser Cys Ser Arg Val Phe Ser Ser Arg Trp Gly Arg Gly Phe Gly 
50 55 60 



Leu Val Glu His Val Leu Gly Gin Asp Ser lie Leu Asn Gin Ser Asn 
65 70 75 80 



Ser lie Phe Gly Cys lie Phe Tyr Thr Leu Gin Leu Leu Leu Gly Cys 
85 90 95 



Leu Arg Thr Arg Trp Ala Ser Val Leu Met Leu Leu Ser Ser Leu Val 
100 105 110 



Ser Leu Ala Gly Ser Val Tyr Leu Ala Trp lie Leu Phe Phe Val Leu 
115 120 125 



Tyr Asp Phe Cys lie Val Cys lie Thr Thr Tyr Ala He Asn Val Ser 
130 135 140 



Leu Met Trp Leu Ser Phe Arg Lys Val Gin Glu Pro Gin Gly Lys Ala 
145 150 155 160 



Lys Arg His 



<210> 11 

<211> 5915 

<212> DNA 

<213> Homo sapiens 

<400> 11 

caeca tcaga tgggaegtet gtgaaggaga gacctcatct ggcccacagc ttggaaagga 60 
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gagactgact 


gttgagttga 


tgcaagctca 


ggtgttgcca 


ggcgggcgcc 


atgatagtag 


120 


agaggttagg 


atactgtcaa 


gggtgtgtgt 


ggccaaagga 


gtggttctgt 


gaatgtatgg 


180 


gagaaaggga 


gaccgaccac 


caggaagcac 


tggtgaggca 


ggacccggga 


ggatgggagg 


240 


ctgcagcccg 


aatggtgcct 


gaaatagttt 


caggggaaat 


gcttggttcc 


cgaatcggat 


300 


cgccgtattc 


gctggatccc 


ctgatccgct 


ggtctctagg 


tcccggatgc 


tgcaattctt 


360 


acaacaggac 


ttggcatagg 


gtaagcgcaa 


atgctgttaa 


ccacactaac 


acactttttt 


420 


ttttcttttt 


tttttttgag 


acagagtctc 


actctgtcgg 


cctggctgga 


gtgcagtggc 


480 


acgatctcgg 


ctcactgcaa 


cctccggctc 


cccggctcaa 


gcaattctcc 


tgcctcagcc 


540 


tcccgagtag 


ctgggattac 


aggcatgtgc 


caccacgccc 


ggctaatttt 


tgtattttta 


600 


gttgagatgg 


ggtttcacca 


tgttggcgag 


gctggtcttg 


aactcctgac 


ctcaggtaat 


660 


ccgccagcct 


cggcctccca 


aagtgctggg 


attacaagcg 


tgagccaccg 


tgcccggcca 


720 


acagttttta 


aatctgtgga 


gacttcattt 


cccttgatgc 


cttgcagccg 


cgccgactac 


780 


aactcccatc 


atgcctggca 


gccgctgggg 


ccgcgattcc 


gcacgtccct 


tacccgcttc 


840 


actagtcccg 


gcattcttcg 


ctgttttcct 


aactcgcccg 


cttgactagc 


gccctggaac 


900 


agccatttgg 


gtcgtggagt 


gcgagcacgg 


ccggccaatc 


gccgagtcag 


agggccagga 


960 


ggggcgcggc 


cattcgccgc 


ccggcccctg 


ctccgtggct 


ggttttctcc 


gcgggcgcct 


1020 


cgggcggaac 


ctggagataa 


tgggcagcac 


ctgggggagc 


cctggctggg 


tgcggctcgc 


1080 


tctttgcctg 


acgggcttag 


tgctctcgct 


ctacgcgctg 


cacgtgaagg 


cggcgcgcgc 


1140 


ccgggaccgg 


gattaccgcg 


cgctctgcga 


cgtgggcacc 


gccatcagct 


gttcgcgcgt 


1200 


cttctcctcc 


aggtgtgcac 


gggagtggga 


ggcgtggggc 


ctcggagcag 


ggcggccagg 


1260 


atgccagatg 


attattctgg 


agtctgggat 


cggtgtgccc 


ggggaacgga 


cacggggctg 


1320 


gactgctcgc 


ggggtcgttg 


cacaggggct 


gagctaccca 


gcgatactgg 


tgttcgaaat 


1380 


aagagtgcga 


ggcaagggac 


cagacagtgc 


tggggactgg 


gattattccg 


gggactcgca 


1440 


cgtgaattgg 


atgccaagga 


ataacggtga 


ccaggaaagg 


cggggaggca 


ggatggcggt 


1500 


agagattgac 


gatggtctca 


aggacggcgc 


gcaggtgaag 


gggggtgttg 


gcgatggctg 


1560 


cgcccaggaa 


caaggtggcc 


cggtctggct 


gtgcgtgatg 


gccaggcgtt 


agcataatga 


1620 


cggaatacag 


aggaggcgag 


tgagtggcca 


gggagctgga 


gattctgggg 


tccagggcaa 


1680 


agataatctg 


cccccgactc 


ccagtctctg 


atgcaaaacc 


gagtgaaccg 


ttataccagc 


1740 


cttgccattt 


taagaattac 


ttaagggccg 


ggcgcggtgg 


cccactcctg 


taatcccagc 


1800 


actttgggag 


gccgaggcgg 


atggatcact 


tgaagtcagg 


agttgaccag 


cctggccaac 


1860 
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atggtgaaag 


i-t r-% t~ rr t~ /**^ /*»+* is 
CCLy LCtCtd 




aaaadttddL 




ggcgggcgcc 




utaatcccag 


ctacucyygg 


gggctaaggc 


aggagaa t eg 


cccgaacccg 


ggaggeggag 




4™ 1 4* 4* *m m /to^/to ^ 

gt c c cag tga 


gccgagaccg 


cgccac ugca 


ccccagcL tg 


ggccagagcg 


ayactccytc 




LCdaaclaaaa 


aaaaaaaaaa 

aaaaadaaaa 


aaaaaaaaan 
aoiciciciciciaeig 


dy cie l. l. ct w u u 


a a^r^+" a an 

aaygtcuciag 


a ^r»a a a arrr*£a 
auyaaaayca 




gggccuacgg 


aycagccacy 


tccyggccty 


geceggggag 


a sagg a 9a ac 


aggg ucagcg 




acacggaacc 


c tgacgcggc 


caaagy ugee 


eggegecagg 


a f*t a ♦* n a a 

ay accdccya 


cccctyydct 




aggatgggag 


gtcggggaac 


agaggatagc 


c c agg c ggc u 


ccc tggaaac 


^ 4" 4" 4™ 4~ 




gggcagggtc 


caaggcactg 


ggttgacagt 


cctaacctgg 


4 s 4 s " y-« m 

ccccacccca 


^N* A y*% ^ *5% M ^»4* 

ccccaccccc 




ctgccaggtg 


gggcaggggt 


ttcgggctgg 


tggagcatgt 


geegggacag 


gacagcatcc 




tcaatcaatc 


caacagcata 


ttcggttgca 




accacagcca 


4* ^ 4— 4V «^ 4" 

ccgccaggcg 


*£4b U 


agtggctccg 


CCCCCtCCCt 


gcccgccccg 


ccccgcccct 


j>>« ^ 4* ^ m 4— 4— 

cacccccctt 


yy wCayCuCa 




gccccactcc 


atgcaatctt 


ggtgatccac 


acagctgaca 


gccagctagc 


cycucatcac 


O C Q A 
Z DO U 


ggagcgtcct 


gcgggtgggg 


atgtggggag 


gtaactaaca 


+*m 4W m 4-4-4-4— 

ggagccccct 


aaccggucca 


Zb4 U 


agtactgtta 


gagg c t ga ag 


ggcccttaaa 


gacaccccag 


gcccccaggc 


tutccgcc eg 




ttgttgtttx 


gagacagggt 


ccggccccgu 


tgcccaaagc 


gaggtccagg 


acgccctcag 




tgtgcactgg 


cgtgatctca 


gttcatggca 


acctctgcct 


ccctgcccaa 


gggatcctcc 


2820 


caccttagcc 


tcccaagcag 


ctggaatcac 


aggcgcgcac 


caccacgccc 


agccaaucc u 


2ooU 


tgtucxtgtt 


t lull tuggc 


agagatggtg 


CCLCyccacy 


ucgcccaggc 


tggtctcaag 




caatctgtct 


gcctcagcct 


cccaaagtgc 




acagg egega 


gctaccatgc 


JUUO 


cccaccaaca 


ccccagctxc 


geggaaaaga 


cgccgaaacc 


ccLLLLuaay 


gagaagc cga 


men 


gcatgagcta 


CCttttgtCt 


catxtagege 


ccagcaggaa 


f5 f5 V I- ^^f hm ^ 4* 

datutytaCC 


tagtcccata 


lion 


agaacagaga 


gaggaaccaa 


ggg a gtgg aa 


gaegaeggeg 


ccccagyccc 


tgccgaugcc 




atatgccgga 


gacgagacca 


CCCattdCCa 


cccuucccag 


caggcuccca 


/kj/Y/f 4- at-* .ft 4* 4- 4- 

CyCtCCCutt 


"7 o An 
oZ4U 


gagtcaccc u 


ccccagcccc 


agagaa.yy ca 


tcac cy ay y g 


»yy cccayca 


ccacyytccu 




ggccgacaca 


egg cue agac 




♦* a ♦*♦*♦■ a a rra a 

tacttaayaa 


dLLLLdLLyv. 


*• ^ a na a *• ^ 

tcdyaacLLt 




ccctccctgg 


geaaeggcaa 


gage ll cay a 


gaccaguccc 


ccggagggga 


cccgccgaag 




CCCtCttLLt 


t-t-*-t-t-t-t-H-*- 
LLLttLLttC 


si a /"wa a a a a 

aagaadLddL 


CLLyCCCCyu 


uy cceagy c t 


ggagcgcagi: 


i a Q n 


ggcacaatca 


tagctcactg 


taacctggct 


caagcgatcc 


tcctgagtag 


ctaggactat 


3540 


aggcatgtca 


ctgcacccag 


ctaatttttt 


tttttttttt 


tttttttttt 


ttgegacata 


3600 


gtctcgctct 


gtcaccaggc 


tggagtgcag 


tggcacgatc 


ttggctcact 


gcaacctctg 


3660 


cctcccgggt 


tcaagcaatt 


ttcctgcctc 


agcctcctga 


gtagctggga 


ctacaggcgc 


3720 
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gtgtcaccac 


gcccagctaa 


tttttgtatt 


tttagtggag 


acagggtttc 


accatgttgg 


3780 


ctaggatggt 


ctcaatctct 


tgacctggtg 


atccatccgc 


cttggcctcc 


caaagtgcta 


3840 


ggattacagg 


cgtgagtcaa 


cctcaccggg 


catttttttt 


ttgagacgaa 


gtcttgctct 


3900 


tgctgcccaa 


gctggaatgt 


ggtggcatga 


tctcggctca 


ctgcaacctc 


cacctcctag 


3960 


gttcaagcga 


ttctccacct 


tagcctcccc 


agcagctggg 


attacaggtg 


cccatcaaca 


4020 


cacccggcta 


atttttgtat 


ttttattaga 


gatggggttt 


tgccatgttg 


gccaggctgc 


4080 


tctcgaactc 


ctaacctcag 


gtgatccacc 


cccattggcc 


tcccaaaata 


ctgggattac 


4140 


aggcatgagc 


caccgtgccc 


agctgaattt 


ctaaattttt 


gatagagatc 


gggtctttct 


4200 


atgttgccca 


agctggtctt 


gaactcctag 


cctaaagcag 


tcttcccacc 


tcggcctccc 


4260 


agagtgtttg 


gaatacgtgc 


gtaagccacc 


acatctgccc 


tggagcctct 


tgttttagag 


4320 


acccttccca 


gcagctcctg 


gcatctaggt 


agtgcagtga 


catcatggag 


tgttcgggag 


4380 


gtggccagtg 


cctgaagccc 


acaccggacc 


ctcttctgcc 


ttgcaggttg 


cctgcggaca 


4440 


cgctgggcct 


ctgtcctgat 


gctgctgagc 


tccctggtgt 


ctctcgctgg 


ttctgtctac 


4500 


ctggcctgga 


tcctgttctt 


cgtgctctat 


gatttctgca 


ttgtttgtat 


caccacctat 


4560 


gctatcaacg 


tgagcctgat 


gtggctcagt 


ttccggaagg 


tccaagaacc 


ccagggcaag 


4620 


gctaagaggc 


actgagccct 


caacccaagc 


caggctgacc 


tcatctgctt 


tgctttggca 


46B0 


tgtgagcctt 


gcctaagggg 


gcatatctgg 


gtccctagaa 


ggccctagat 


gtggggcttc 


4740 


tagattaccc 


cctcctcctg 


ccatacccgc 


acatgacaat 


ggaccaaatg 


tgccacacgc 


4800 


tcgctctttt 


ttacacccag 


tgcctctgac 


tctgtcccca 


tgggctggtc 


tccaaagctc 


4860 


tttccattgc 


ccagggaggg 


aaggttctga 


gcaataaagt 


ttcttagatc 


aatcagccaa 


4920 


gtctgaacca 


tgtgtctgcc 


atggactgtg 


gtgctgggcc 


tccctcggtg 


ttgccttctc 


4980 


tggagctggg 


aagggtgagt 


cagagggaga 


gtggagggcc 


tgctgggaag 


ggtggttatg 


5040 


ggtagtctca 


tctccagtgt 


gtggagtcag 


caaggcctgg 


ggcaccattg 


gcccccaccc 


5100 


ccaggaaaca 


ggctggcagc 


tcgctcctgc 


tgcccacagg 


agccaggcct 


cctctcctgg 


5160 


gaaggctgag 


cacacacctg 


gaagggcagg 


ctgcccttct 


ggttctgtaa 


atgcttgctg 


5220 


ggaagttctt 


ccttgagttt 


aactttaacc 


cctccagttg 


ccttatcgac 


cattccaagc 


5280 


cagtattggt 


agccttggag 


ggtcagggcc 


aggttgtgaa 


ggtttttgtt 


ttgcctatta 


5340 


tgccctgacc 


acttacctac 


atgccaagca 


ctgtttaaga 


acttgtgttg 


gcagggtgca 


5400 


gtggctcaca 


cctgtaatcc 


ctgtactttg 


ggaggccaag 


gcaggaggat 


cacttgaggc 


5460 


caggagttcc 


agaccagcct 


gggcaaaata 


gtgagacccc 


tgtctctaca 


aaaaaaaaaa 


5520 
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aaaaaaaaaa ttagccaggc 


atggcggtgt. 


acyCdCCtat agCCCCaaCL ddtcgyyady 




ccggcgggaa gactgcctga 




utyayycugu agcgagccau gatcactgca 


CC40 


ctccagcctg agcaacagag 




f- pr<a an 3 33 3 ra^a#"»33^333/"» 333333333/^ 

ICCaaaaaad aacaaddaaC clclaciciclcictciL; 




ttgtgttaac gtgttaaact 


cgtttaatct 


ttacagtgat ttatgaggtg ggtactatta 


5760 


ttatccctat cttgatgata 


gggacagagt 


ggctaattag tatgcctgag atcacacagc 


5820 


tactgcagga ggctctcagg 


atttgaatcc 


acctggtcca tctggctcca gcatctatat 


5880 


gctttttttt ttgttggttt 


gtttttgaga 


cggac 


5915 



<210> 12 

<211> 5915 

<212> DNA 

<213> Homo sapiens 



<400> 12 
caccatcaga 


tgggacgtct 


gtgaaggaga 


gacctcatct ggcccacagc ttggaaagga 


60 


gagactgact 


gttgagttga 


tgcaagctca 


ggtgttgcca ggcgggcgcc atgatagtag 


120 


agaggttagg 


atactgtcaa 


gggtgtgtgt 


ggccaaagga gtggttctgt gaatgtatgg 


180 


gagaaaggga 


gaccgaccac 


caggaagcac 


tggtgaggca ggacccggga ggatgggagg 


240 


ctgcagcccg 


aatggtgcct 


gaaatagttt 


caggggaaat gcttggttcc cgaatcggat 


300 


cgccgtattc 


gctggatccc 


ctgatccgct 


ggtctctagg tcccggatgc tgcaattctt 


360 


acaacaggac 


ttggcatagg 


gtaagcgcaa 


atgctgttaa ccacactaac acactttttt 


420 


ttttcttttt 


tttttttgag 


acagagtctc 


actctgtcgg cctggctgga gtgcagtggc 


480 


acgatctcgg 


ctcactgcaa 


cctccggctc 


cccggctcaa gcaattctcc tgcctcagcc 


540 


tcccgagtag 


ctgggattac 


aggcatgtgc 


caccacgccc ggctaatttt tgtattttta 


600 


gttgagatgg 


ggtttcacca 


tgttggcgag 


gctggtcttg aactcctgac ctcaggtaat 


660 


ccgccagcct 


cggcctccca 


aagtgctggg 


attacaagcg tgagccaccg tgcccggcca 


720 


acagttttta 


aatctgtgga 


gacttcattt 


cccttgatgc cttgcagccg cgccgactac 


780 


aactcccatc 


atgcctggca 


gccgctgggg 


ccgcgattcc gcacgtccct tacccgcttc 


840 


actagtcccg 


gcattcttcg 


ctgttttcct 


aactcgcccg cttgactagc gccctggaac 


900 


agccatttgg 


gtcgtggagt 


gcgagcacgg 


ccggccaatc gccgagtcag agggccagga 


960 


ggggcgcggc 


cattcgccgc 


ccggcccctg 


ctccgtggct ggttttctcc gcgggcgcct 


1020 


cgggcggaac 


ctggagataa 


tgggcagcac 


ctgggggagc cctggctggg tgcggctcgc 


1080 


tctttgcctg 


acgggcttag 


tgctctcgct 


ctacgcgctg cacgtgaagg cggcgcgcgc 


1140 


ccgggaccgg 


gattaccgcg 


cgctctgcga 


cgtgggcacc gccatcagct gttcgcgcgt 


1200 
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cttctcctcc aggcgugcac 


gggagcggga 


ggcgcggggc 


c c cggagcag 


ggeggecagg 


1 OfiO 
l*OU 


atgccagatg atcattctgg 


aguccgggac 


cggt-gcgcct. 


ggggaacgga 


cacgyggcuy 




gactgctcgc ggggtcgttg 


cacaggggcc 


gagccacccd 


gcgacactgg 


eg ttcydddL 


1 ISO 


aagagtgcga ggcaagggac 


c aga c ag cgc 


t9999 act 99 


gac cactLccg 


gggactcgea 




cgtgaattgg atgccaagga 


acaacggcga 


ccaggaaagy 


agg c a 


ggatggcggc 


1 500 


agagattgac gatggtctca 


aggacggcgc 


gcaggcgaag 


9s39g c g tt: 9 


gega egge eg 




cgcccaggaa caaggtggcc 


cggcccggcc 


gcgcgcgatg 


gccaggcgu t. 


agCdCdduya 




cggaatacag aggaggcgag 


tgagtggcca 


gggagctgga 


gattc c gggg 


u c cagggc aa 


If on 
lOtJU 


agataatctg cccccgactc 


ccagtctctg 


atgcaaaacc 


gagcgaac eg 


LLdLdCCa^C 


1 HAC\ 


cttgccattt taagaattac 


ttaagggccg 


ggcgcggtgg 


cccactcctg 


caaucccagc 


loQU 


actttgggag gccgaggcgg 


atggatcact 


tgaagtcagg 


ag 1 1 gaccag 


cctggccaac 


XobU 


atggtgaaag cctgtctcta 


ccaaaaatag 


aaaaattaat 


egggegctat 


ggcgggtgcc 




ttaatcccag ctactcgggg 


gggctaaggc 


aggagaatcg 


cttgaacccg 


g9 a 99 c Sr9 a 9 




gtttcagtga gccgagatcg 


cgccactgca 


ctccagcctg 


ggccagagtg 


agactccgtc 


2U4U 


tcaaaaaaaa aaaaaaaaaa 


aaaaaaaaag 


agacttactx 


aagg c c uaag 


9 ^T9 9 9 9 9 

augaaaagca 


*sJLUU 


gggcctacgg agtagccacg 


tccgggcctg 


gtctggggag 


*m*m*m*m J'B * ■ 9 ^ 

aggggaggat 


a 999 tca 9 t 9 




acatggaatc ctgacgtggc 


caaaggtgcc 


cggtgccagg 


agaccatcga 


cccccggacc 




aggatgggag gtcggggaac 


agaggatagc 


ccaggtggct 


tcttggaaat 


cacctccczc 




gggcagggtc caaggcactg 


ggttgacagt 


cctaacctgg 


ttccacccca 


ccccaccccc 


Old A 


ctgccaggtg gggcaggggt 


ttcgggctgg 


tggagcatgt 


gcugggacag 


gacagcaccc 




tcaatcaatc caacagcata 


ttcggttgca 


tcttccacac 


accacagcca 


ttgccaggcg 




agtggctccg ccccctccct 


gcccgccccg 


ccccgccccc 


CdtCCCCCtU 


yy uCdyCtCd 




gccccactcc atgcaatctt 


ggcgacccac 


dCdyCtyaCa 






2580 


cgagcgtcct gcgggtgggg 


atgtggggag 


guaaccaaca 


ggay tcttt t 


dattyy ttt* 




agtactgtta gaggctgaag 


ggccctcaaa 


gacatcctag 


g tccccayy t 


f t* h*t-rrt*t- tri 
tttttyLLty 


« / vv 


ttgttgtttt gagacagggt 


ctggctctgt 


tgcccaaagt 


gaggecuagg 


atyCCCLLay 


O*7U0 


tgtgcactgg cgtgatctca 


gttcatggca 


acctctgcct 


ccccgcccaa 


gggaccctcc 


ZD ZU 


caccttagcc tcccaagcag 


ctggaatcac 


aggcgtgcac 


cactatgccc 


agctaatttt 


2880 


tgtttttgtt tttttttggt 


agagatggtg 


tctcgccatg 


ttgcccaggc 


tggtctcaag 


2940 


caatctgtct gcctcagcct 


cccaaagtgc 


tggggggatt 


acaggcgtga 


gctaccatgc 


3000 


cccaccaaca ccccagtttt 


gtggaaaaga 


tgccgaaatt 


cctttttaag 


gagaagctga 


3060 
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gcatgagcta 


tcttttgtct 


catttagtgc 


tcagcaggaa 


aatttgtatc 


tagtcccata 


3120 


agaacagaga 


gaggaaccaa 


gggagtggaa 


gacgatggcg 


ccccaggcct 


tgctgatgcc 


3180 


atatgccgga 


gatgagacta 


tccattacca 


cccttcccag 


caggctccca 


cgctcccttt 


3240 


gagtcaccct 


tcccagctcc 


agagaaggca 


tcactgaggg 


aggcccagca 


ccatggtcct 


3300 


ggctgacaca 


tggttcagac 


ttggccgatt 


tatttaagaa 


attttattgc 


tcagaacttt 


3360 


ccctccctgg 


gcaatggcaa 


gagcttcaga 


gaccagtccc 


ttggagggga 


cctgttgaag 


3420 


ccttcttttt 


tttttttttt 


aagaaataat 


cttgctctgt 


tgcccaggct 


ggagtgcagt 


3480 


ggcacaatca 


tagctcactg 


taacctggct 


caagcgatcc 


tcctgagtag 


ctaggactat 


3540 


aggcatgtca 


ctgcacccag 


ctaatttttt 


tttttttttt 


tttttttttt 


ttgcgacata 


3600 


gtctcgctct 


gtcaccaggc 


tggagtgcag 


tggcacgatc 


ttggctcact 


gcaacctctg 


3660 


cctcccgggt 


tcaagcaatt 


ttcctgcctc 


agcctcctga 


gtagctggga 


ctacaggcgc 


3720 


gtgtcaccac 


gcccagctaa 


tttttgtatt 


tttagtggag 


acagggtttc 


accatgttgg 


3780 


ctaggatggt 


ctcaatctct 


tgacctggtg 


atccatccgc 


cttggcctcc 


caaagtgcta 


3840 


ggattacagg 


cgtgagtcaa 


cctcaccggg 


catttttttt 


ttgagacgaa 


gtcttgctct 


3900 


tgctgcccaa 


gctggaatgt 


ggtggcatga 


tctcggctca 


ctgcaacctc 


cacctcctag 


3960 


gttcaagcga 


ttctccacct 


tagcctcccc 


agcagctggg 


attacaggtg 


cccatcaaca 


4020 


cacccggcta 


atttttgtat 


ttttattaga 


gatggggttt 


tgccatgttg 


gccaggctgc 


4080 


tctcgaactc 


ctaacctcag 


gtgatccacc 


cccattggcc 


tcccaaaata 


ctgggattac 


4140 


aggcatgagc 


caccgtgccc 


agctgaattt 


ctaaattttt 


gatagagatc 


gggtCtttCt 


4200 


atgttgccca 


agctggtctt 


gaactcctag 


cctaaagcag 


tcttcccacc 


tcggcctccc 


4260 


agagtgtttg 


gaatacgtgc 


gtaagccacc 


acatctgccc 


tggagcctct 


tgttttagag 


4320 


acccttccca 


gcagctcctg 


gcatctaggt 


agtgcagtga 


catcatggag 


tgttcgggag 


4380 


gtggccagtg 


cctgaagccc 


acaccggacc 


ctcttctgcc 


ttgcaggttg 


cctgcggaca 


4440 


cgctgggcct 


ctgtcctgat 


gctgctgagc 


tccctggtgt 


ctctcgctgg 


ttctgtctac 


4500 


ctggcctgga 


tcctgttctt 


cgtgctctat 


gatttctgca 


ttgtttgtat 


caccacctat 


4560 


gctatcaacg 


tgagcctgat 


gtggctcagt 


ttccggaagg 


tccaagaacc 


ccagggcaag 


4620 


gctaagaggc 


actgagccct 


caacccaagc 


caggctgacc 


tcatctgctt 


tgctttggca 


4680 


tgtgagcctt 


gcctaagggg 


gcatatctgg 


gtccctagaa 


ggccctagat 


gtggggcttc 


4740 


tagattaccc 


cctcctcctg 


ccatacccgc 


acatgacaat 


ggaccaaatg 


tgccacacgc 


4800 


tcgctctttt 


ttacacccag 


tgcctctgac 


tctgtcccca 


tgggctggtc 


tccaaagctc 


4860 
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tttccattgc 


ccagggaggg 


aaggttctga 


gcaataaagt ttcttagatc aatcagccaa 


4920 


gtctgaacca 


tgtgtctgcc 


acggactgtg 


gtgctgggcc 


tccctcggtg ttgccttctc 


a q a a 


tggagctggg 


aagggtgagt 


cagagggaga 


gtggagggcc 


tgctgggaag ggtggttatg 


504O 


ggtagtctca 


tctccagtgt 


gtggagtcag 


caaggcctgg ggcaccattg gcccccaccc 


CI AA 

5100 


ccaggaaaca 


ggctggcagc 


tcgctcctgc 


tgcccacagg agccaggcct cctctcctgg 


CI C A 


gaaggctgag 


cacacacctg 


gaagggcagg 


ctgcccttct ggttctgtaa atgcttgctg 


5220 


^ m ^ _ fa, A. 

ggaagttctt 


ccttgagttt 


aactttaacc 


cctccagttg ccttatcgac cattccaagc 


c a A 
52 80 


cagtattggt 


agccttggag 


ggtcagggcc 


aggttgtgaa ggtttttgtt ttgcctatta 


5340 


tgccctgacc 


acttacctac 


atgccaagca 


ctgtttaaga 


acttgtgttg gcagggtgca 


5400 


gtggctcaca 


cctgtaatcc 


a ^ i_ m m fa A- fa. 

ctgtactttg 


ggaggccaag gcaggaggat cacttgaggc 


C A C A 

5460 


caggagttcc 


agaccagcct 


gggcaaaata 


gtgagacccc 


tgtctctaca aaaaaaaaaa 


5520 


aaaaaaaaaa 


ttagccaggc 


atggtggtgt 


atgtacctat 


agtcccaact aatcgggaag 


5580 


ctggcgggaa 


gactgcttga 


gcccagaagg 


ttgaggctgc 


agtgagccat gatcactgca 


5640 


ctccagcctg 


agcaacagag 


caagaccgtc 


tccaaaaaaa 


aacaaaaaac aaaaaaaaac 


5700 


ttgtgttaac 


gtgttaaact 


cgtttaatct 


ttacagtgat 


ttatgaggtg ggtactatta 


5760 


ttatccctat 


cttgatgata 


gggacagagt 


ggctaattag 


tatgcctgag atcacacagc 


5820 


tactgcagga 


ggctctcagg 


atttgaatcc 


acctggtcca 


tctggctcca gcatctatat 


5880 


gctttttttt 


ttgttggttt 


gtttttgaga 


cggac 




5915 



<210> 13 

<211> 5915 

<212> DNA 

<213> Homo sapiens 

<400> 13 



caccatcaga tgggacgtct 


gtgaaggaga 


gacctcatct ggcccacagc ttggaaagga 


60 


gagactgact gttgagttga 


tgcaagctca 


ggtgttgcca ggcgggcgcc atgatagtag 


120 


agaggttagg atactgtcaa 


gggtgtgtgt 


ggccaaagga gtggttctgt gaatgtatgg 


180 


gagaaaggga gaccgaccac 


caggaagcac 


tggtgaggca ggacccggga ggatgggagg 


240 


ctgcagcccg aatggtgcct 


gaaatagttt 


caggggaaat gcttggttcc cgaatcggat 


300 


cgccgtattc gctggatccc 


ctgatccgct 


ggtctctagg tcccggatgc tgcaattctt 


360 


acaacaggac ttggcatagg 


gtaagcgcaa 


atgctgttaa ccacactaac acactttttt 


420 


ttttcttttt tttttttgag 


acagagtctc 


actctgtcgg cctggctgga gtgcagtggc 


480 


acgatctcgg ctcactgcaa 


cctccggctc 


cccggctcaa gcaattctcc tgcctcagcc 


540 
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tcccgagtag 




annpahnf np 

dyycdty tgc 




aac t a a h t* t* fr- 


t-rrt* a ** t- 1- 1* t* a 

ty tdt 1 1 ttd 


D UU 


gttgagatgg 


rrpt* tp pa PPa 
yytttCaCta 


t"n 1" t"PPPpaP 

tyt ty y eg ay 


rr r» era t c 1 1 cr 
yctyyuuuLy 


aa\>u^v> Lydt 


ph pappt" aat* 
t ttdyy tdd t 


DOW 


P /"*P /ina p P T~ 

ccgccaycct 


cyycctccca 


day ty t ty y y 


aft* acaacrccr 




f PP PPPT ptp pa 

ty ttvyy ttd 


720 




a a ^ rr ^ ppa 
aatttytyya 




rrrttaatoc 


p« t~ fcorr*apTP*p , pi 




780 


aactcccatc 


a t* n f+ ^ si rr a 

atycctyyca 


ft PPflP^pj firm 

gccgccgggg 


prrrroa t* rr 
c% v» 


ptp" a p*prf* g*r*r*t" 
y wa^y utttv 


tdtt ty 1 1 tt 


840 


act. ay tcccg 


ppa ♦*> t*> pt" t" pp 
yCattCtLCy 


r» rr t- 1 - 1* t* <^ o h 

L.y LL.L ttUL 






pyprnhofiaar 
y ttt uyyaac 


900 
^ \j \j 


agccatt tgg 


g tcgtggagt 


ppnanpa r*cm 

gegagcaegg 


ppftnppaahr 
ccy y wwdd 


ptp* fcrapwT* pan 

yttyaytcay 


a PTPTpppanrra 
dyyyucdyyd 


960 


ggggcgcggc 


cat tcgccgc 


ccyycccctg 


ctccy tyytt 


ptptt* t" hhpfpp 

yyttttctcc 


/-T /-1/-T PTPP"P/PP t" 

gey y y wy ttt 


1020 


cgggcggaac 


c tggaga taa 


^pjpjnparrp'a f 

tgyy ceiy Cdc 


ctgggggagc 




t* fTortPTP" t* rnr 
t y ty y 1 1 ty t 


1080 


tCtLLyCCty 


a w ici p /~» V r* a p 

acgggcttag 


tyCtctcyct 


ctotytytty 


pa opt t* pra a ptpt 
LaLy Lyaayy 


PfWPCT Pfl PPP 

ty y ty ty ty 


1140 


ccgggaccgg 


gatxaccgcg 


cgc u c eg cga 


Cy tyyy cacc 


ptp pa ♦* pa ptpt* 

yccdttayct 


prt" t* pp/ppTPPrt* 
y t tcy cy cy t 


1 900 


cttctcctcc 


aggtgtgcac 


ptptp* a pt rmri a 

gggagtggga 


pt rr p* rr pt Pt ptpt e* 

ggegegggge 


pt" ppprapTpapr 

c t. cyyaycay 


ptptppp/p pa PfT 

ggeggecagg 


I960 


atgccagatg 


attat tc egg 


age c eggga z 


cgguguyccc 


ptpt ptpt a a p ptpt a 

ggggaacgga 


pa p< i/innnf p 

caegggge tg 




gactgctcgc 


ggggtcgttg 


cacaggggct. 


yayccaccca 


pp; a fp a p pp 

ycyatactyy 


♦~ p t* t* ppa a a t~ 

ty t tcydddt 




aagagtgcga 


gg caagggac 


cagacagcgc 


c gggg act gg 


<r a T~ T* a T* T* PPPT 

y at cat t ccy 


pppa pt* pppa 

y gy a c t cyca 


i a a n 


cgtgaaccgg 


atgccaagga 


a V 5» a /t/n'i H Pr a 

ataaeggega 


a pm a a a wm 

CCayyaaayy 


p^yp/pp a # ipt p a 

c gy gg & gg c ° 


nrta t"ppppp+* 

ggatggcggc 


1 -^00 


agagattgac 


gatggcctca 


aggaeggege 


<^p» a ^ V* a a P<r 

geaggegaag 


PP*T/-TP T* P T" T* PT 

ggggg^g^^g 


y Cy d t gg c t y 


1 ^60 


cgcccaggaa 


caaggtggcc 


eggtc egget 


gugegugaty 


pppa PPPPT* t* 

yccayycytt 


a ppa haa Hrta 
dy td tddtyd 


1 690 


cggaatacag 


aggaggcgag 


tgagcggccd 


rtn pt a ^p»T*PTPra 

gggagccgga 


pa t~ T" p f" ppppt 

yat t c ty y y y 


h ppa ppp p a a 
tttdyyy tdd 


1 6R0 


agacaacccg 


cccccy ac LC 


p^ p» apf p» 4" p» t" pt 

ccay tctcty 


a T - pTP*a a a a p^p* 
duyuddddtt 


paprf*paappPT 

yaytyaaccy 


r* t" a t"a ppapp 
t td uaLUdy t 


1740 


Ml>f a ♦» 4- t" 

CLLyccattL 


t* a a p a a t* t~ a p 

LaayaattaC 


ffa apTPTPfp*p , PT 

ttaayyyccy 


PTPT P»P« P*PTPT f" PlfT 

ggegeggegg 


pppa pt* ppf er 
tttdttttuy 


b>dd tttwdy w 


1 flOO 


a ^ t - t~ t* /t/t/t a p 

acct tgggag 


y ccgdygcgy 


a ♦"pTPta +• p»a p«f- 
dLyydLLdLL 


r*pya a pti* p»a pi f i 

tyaaytcayy 


apt - t*na ppafr 
dy t ty dtwdy 


pp*- ptpt ppa a p 


1860 


atgg tgaaag 


p p fr- f-yf- p|- pt" a 


ttaddad tdy 


ddddd w lad L> 


ppTPTPf ppyp hah 
cyyy tyc tdt 


ppiPTrrrrrhPTPP 
ygtyyytytt 


1920 

J> J A W 




cuatucyyyy 


y y y v* u nay y 


aeroaoaa r* ptt 
dy ydy dd i»v»y 


t ty dd\»i>*v*y 


crcrarrpjccrcTapT 
yydyy tyydy 


1980 


pt- t* h* porrhrra 

y ttt cag tya 


*-» ^ ppa rra t* pp 
yccydydtcy 


tycedctytd 


p , t"p , p*aPTP»r<t"PT 
cuctayttuy 


yycvayay ty 


a pa pfpprrh P 
dydvu^uy ut 


2040 


♦* pa a a a a a a a 


aaaaaaaaaa 
a add d ddddd 


aaaaaaaaarr 
dddddddddy 


dydtttdCtu 


a a PTPT t" ptaafl 

day y u uaay 


ct uy cto g ay »»a 


2100 


gggcctacgg 


a arrpra pp 

aytayccacy 


w p)/wtw rrp p*^ pt 

c ceggy c c t-y 


Pf ♦* P» PTPTPf PT a PT 

geceggggag 


a ppiPTPa pp a f 

aggggaggac 


a pppt" pa pt* p 

dyy y t toy ty 


91 60 


acatggaatc 


ctgacgtggc 


caaaggtgee 


eggtgecagg 


agatcatcga 


cccttggact 


2220 


aggatgggag 


gtcggggaac 


agaggatagc 


ccaggtggct 


tcttggaaat 


cacctttctc 


2280 


gggcagggtc 


caaggcactg 


ggttgacagt 


cctaacctgg 


ttccacccca 


ccccacccct 


2340 


ctgecaggtg 


gggcaggggt 


ttcgggctgg 


tggagcatgt 


gctgggacag 


gacagcatcc 


2400 
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tcaatcaatc 


caacagcata 


ttcggttgca 


tcttctacac 


actacagcta 


ttgttaggtg 


2460 


agtggctccg 


ccccctccct 


gcccgccccg 


ccccgcccct 


catccccctt 


ggtcagctca 


2520 


gccccactcc 


atgcaatctt 


ggtgatccac 


acagctgaca 


gccagctagc 


tgctcatcac 


2580 


ggagcgtcct 


gcgggtgggg 


atgtggggag 


gtaactaaca 


ggagtctttt 


aattggttta 


2640 


agtactgtta 


gaggctgaag 


ggcccttaaa 


gacatcctag 


gtccccaggt 


tttttgtttg 


2700 


ttgttgtttt 


gagacagggt 


ctggctctgt 


tgcccaaagt 


gaggtctagg 


atgcccttag 


2760 


tgtgcactgg 


cgtgatctca 


gttcatggca 


acctctgcct 


ccctgcccaa 


gggatcctcc 


2820 


caccttagcc 


tcccaagcag 


ctggaatcac 


aggcgtgcac 


cactatgccc 


agctaatttt 


2880 


tgtttttgtt 


tttttttggt 


agagatggtg 


tctcgccatg 


ttgcccaggc 


tggtctcaag 


2940 


caatctgtct 


gcctcagcct 


cccaaagtgc 


tggggggatt 


acaggcgtga 


gctaccatgc 


3000 


cccaccaaca 


ccccagtttt 


gtggaaaaga 


tgccgaaatt 


cctttttaag 


gagaagctga 


3060 


gcatgagcta 


tcttttgtct 


catttagtgc 


tcagcaggaa 


aatttgtatc 


tagtcccata 


3120 


agaacagaga 


gaggaaccaa 


gggagtggaa 


gacgatggcg 


ccccaggcct 


tgctgatgcc 


3180 


atatgccgga 


gatgagacta 


tccattacca 


cccttcccag 


caggctccca 


cgctcccttt 


3240 


gagtcaccct 


tcccagctcc 


agagaaggca 


tcactgaggg 


aggcccagca 


ccacggtcct 


3300 


ggctgacaca 


tggttcagac 


ttggccgatt 


tatttaagaa 


attttattgc 


tcagaacttt 


3360 


ccctccctgg 


gcaatggcaa 


gagcttcaga 


gaccagtccc 


ttggagggga 


cctgttgaag 


3420 


ccttcttttt 


tttttttttt 


aagaaataat 


cttgctctgt 


tgcccaggct 


ggagtgcagt 


3480 


ggcacaatca 


tagctcactg 


taacctggct 


caagcgatcc 


tcctgagtag 


ctaggactat 


3540 


aggcatgtca 


ctgcacccag 


ctaatttttt 


tttttttttt 


tttttttttt 


ttgcgacata 


3600 


gtctcgctct 


gtcaccaggc 


tggagtgcag 


tggcacgatc 


ttggctcact 


gcaacctctg 


3660 


cctcccgggt 


tcaagcaatt 


ttcctgcctc 


agcctcctga 


gtagctggga 


ctacaggcgc 


3720 


gtgtcaccac 


gcccagctaa 


tttttgtatt 


tttagtggag 


acagggtttc 


accatgttgg 


3780 


ctaggatggt 


ctcaatctct 


tgacctggtg 


atccatccgc 


cttggcctcc 


caaagtgcta 


3840 


ggattacagg 


cgtgagtcaa 


cctcaccggg 


catttttttt 


ttgagacgaa 


gtcttgctct 


3900 


tgctgcccaa 


gctggaatgt 


ggtggcatga 


tctcggctca 


ctgcaacctc 


cacctcctag 


3960 


gttcaagcga 


ttctccacct 


tagcctcccc 


agcagctggg 


attacaggtg 


cccatcaaca 


4020 


cacccggcta 


atttttgtat 


ttttattaga 


gatggggttt 


tgccatgttg 


gccaggctgc 


4080 


tctcgaactc 


ctaacctcag 


gtgatccacc 


cccattggcc 


tcccaaaata 


ctgggattac 


4140 


aggcatgagc 


caccgtgccc 


agctgaattt 


ctaaattttt 


gatagagatc 


gggtctttct 


4200 
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atgttgccca 


agctggcctt 


/T aar^^^*Mfa>«« .» 

gaacccctag 


cccaaagcag tcctcccacc ccggcctccc 




agagtgtttg 


gaatacgtgc 


gcaagccacc 


acatccgccc cggagccLCt cgtuucagag 




acccttccca 


gcagctcctg 


gcatctaggt 


agtgcagtga catcatggag tgttcgggag 


/"ion 


gtggccagtg 


cctgaagccc 


acaccggacc 


ctctxctgcc ccgcaggctg cccgcggaca 




cgctgggcct 


ctgtcctgat 


gctgctgagc 


V V* i'irt ¥~ ft ^ ^ 4^ r+ ^ sr^r ^ 4» ^» V" a ^ 

tCCCuggtgt CLctcyccyg ctcty tccac 




ctggcctgga 


tcctgtcctt 


cgtgctctat 


gacctctgca tty tctytac caccaccuau 




gctatcaacg 


tgagcctgat 


gtggctcagt 


ttccggaagg tccaagaacc ccagggcaag 


A CO ft 


gctaagaggc 


actgagccct 


caacccaagc 


caggctgacc tcatctgctt tgctttggca 


4 £T Q ft 


tgtgagcctt 


gcctaagggg 


gcatatctgg 


gtccctagaa ggccccagat gcggggccuc 


A T Aft 


tagattaccc 


cctcctcctg 


ccatacccgc 


acacgacaac ggaccaaacg cgccacacgc 


A Q ft ft 


tcgctctttt 


ttacacccag 


tgcctctgac 


tctgtcccca tgggctggtc tccaaagctc 


A AC ft 


tttccattgc 


ccagggaggg 


aaggttctga 


gcaataaagt ttcttagatc aatcagccaa 


A Of) ft 


gtctgaacca 


tgtgtctgcc 


atggactgtg 


gtgctgggcc tccctcggtg ttgccttctc 




tggagctggg 


aagggtgagt 


cagagggaga 


gtggagggcc tgctgggaag ggtggttatg 


C ft Vt ft 

5040 


ggtagtctca 


tctccagtgt 


gtggagtcag 


caaggcctgg ggcaccattg gcccccaccc 


5100 


ccaggaaaca 


ggctggcagc 


tcgctcctgc 


tgcccacagg agccaggcct cctctcctgg 


SloO 


gaaggctgag 


cacacacctg 


gaagggcagg 


ctgcccttct ggttctgtaa atgcttgctg 


5220 


ggaagttctt 


ccttgagttt 


aactttaacc 


cctccagttg ccttatcgac cattccaagc 


52UU 


cagtattggt 


ag c c 1 1 ggag 


ggtcagggcc 


aggtxgugaa ggucucugtu ttgccuaLLa 


CI Aft 


tgccctgacc 


acttacctac 


atgccaagca 


ccgcccaaga acctgtgccg gcagggcgca 


C A ft ft 

5400 


gtggctcaca 


cctgtaatcc 


ctgtaccttg 


ggaggccaag gcaggaggat cacttgaggc 


C ACf\ 
54o0 


caggagttcc 


agaccagcct 


gggcaaaata 


gtgagacccc tgtctctaca aaaaaaaaaa 


5520 


aaaaaaaaaa 


ttagccaggc 


atggcggtgc 


acgcacccac agccccaacu aaccgggaag 


55o0 


ctggcgggaa 


gactgcttga 


gcccagaagg 


ccgaggccgc agcgagccac gaccacugca 


RCA ft 


ccccagcccg 


agca a c agag 


caag accg tc 


UCCaaaaaaa aaCaaaaaaC aaaaaaaaaC 


C7ft ft 


ttgtgttaac 


gtgttaaact 


cgtttaatct 


ttacagtgat ttatgaggtg ggtactatta 


5760 


ttatccctat 


cttgatgata 


gggacagagt 


ggctaattag tatgcctgag atcacacagc 


5820 


tactgcagga 


ggctctcagg 


atttgaatcc 


acctggtcca tctggctcca gcatctatat 


5880 


gctttttttt 


ttgttggttt 


gtttttgaga 


cggac 


5915 



<210> 14 
<211> 5915 
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<212> DNA 

<213> Homo sapiens 




<400> 14 
caccatcaga 


tgggacgtct 


gtgaaggaga 


gagactgact 


gttgagttga 


tgcaagctca 


agaggttagg 


atactgtcaa 


gggtgtgtgt 


gagaaaggga 


gaccgaccac 


caggaagcac 


ctgcagcccg 


aatggtgcct 


gaaatagttt 


cgccgtattc 


gctggatccc 


ctgatccgct 


acaacaggac 


ttggcatagg 


gtaagcgcaa 


ttttcttttt 


tttttttgag 


acagagtctc 


acgatctcgg 


ctcactgcaa 


cctccggctc 


tcccgagtag 


ctgggattac 


aggcatgtgc 


gttgagatgg 


ggtttcacca 


tgttggcgag 


ccgccagcct 


cggcctccca 


aagtgctggg 


acagttttta 


aatctgtgga 


gacttcattt 


aactcccatc 


atgcctggca 


gccgctgggg 


actagtcccg 


gcattcttcg 


ctgttttcct 


agccatttgg 


gtcgtggagt 


gcgagcacgg 


ggggcgcggc 


cattcgccgc 


ccggcccctg 


cgggcggaac 


ctggagataa 


tgggcagcac 


tctttgcctg 


acgggcttag 


tgctctcgct 


ccgggaccgg 


gattaccgcg 


cgctctgcga 


cttctcctcc 


aggtgtgcac 


gggagtggga 


atgccagatg 


attattctgg 


agnccgggar 


gactgctcgc 


ggggtcgttg 


cacaggggct 


aagagtgcga 


ggcaagggac 


cagacagtgc 


cgtgaattgg 


atgccaagga 


ataacggtga 


agagattgac 


gatggtctca 


aggacggcgc 


cgcccaggaa 


caaggtggcc 


cggtctggct 


cggaatacag 


aggaggcgag 


tgagtggcca 


agataatctg 


cccccgactc 


ccagtctctg 



PCT/US2004/031481 



gacctcatct 


ggcccacagc 


ttggaaagga 


60 


ggtgttgcca 


ggcgggcgcc 


atgatagtag 


120 


ggccaaagga 


gtggttctgt 


gaatgtatgg 


1 Q A 

180 


tggtgaggca 


ggacccggga 


ggatgggagg 


O VI A 

«s40 


caggggaaat 


gcttggttcc 


cgaatcggat 


■5 n n 

300 


ggtctctagg 


tcccggatgc 


tgcaattctt 


"i aC A 

360 


atgctgttaa 


ccacactaac 


acactttttt 


420 


actctgtcgg 


cctggctgga 


gtgcagtggc 


480 


cccggctcaa 


gcaattctcc 


tgcctcagcc 


540 


caccacgccc 


ggctaatttt 


tgtattttta 


600 


gctggtcttg 


aactcctgac 


ctcaggtaat 


660 


attacaagcg 


tgagccaccg 


tgcccggcca 


720 


cccttgatgc 


cttgcagccg 


cgccgactac 


780 


ccgcgattcc 


gcacgtccct 


tacccgcttc 


840 


aactcgcccg 


cttgactagc 


gccctggaac 


AAA 
900 


ccggccaatc 


gccgagtcag 


agggccagga 


960 


ctccgtggct 


ggttttctcc 


gcgggcgcct 


1020 


ctgggggagc 


cctggctggg 


tgcggctcgc 


T A O A 
1080 


ctacgcgctg 


cacgcgaagg 


cggcgcgcgc 


1140 


cgcgggcacc 


gccatcagct 


gcxcgcgcgc 


1 OAA 


ggcgtggggc 


cxcggagcag 


ggcgg c c agg 


1 A, 


cggcgcgccc 


ggggaacgga 


c ci cy g y y c t g 




gagctaccca 


gcgacactgg 




noon 


tggggactgg 


gattattccg 


gggactcgca 


1440 


ccaggaaagg 


cggggaggca 


ggatggcggt 


1500 


gcaggtgaag 


gggggtgttg 


gcgatggctg 


1560 


gtgcgtgatg 


gccaggcgtt 


agcataatga 


1620 


gggagctgga 


gattctgggg 


tccagggcaa 


1680 


atgcaaaacc 


gagtgaaccg 


ttataccagc 


1740 
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cttgccattt 


taagaattac 


ttaagggccg 


ggcgcggtgg 


cccactcctg 


taatcccagc 


1800 


actttgggag 


gccgaggcgg 


atggatcact 


tgaagtcagg 


agttgaccag 


cctggccaac 


1860 


atggtgaaag 


cctgtctcta 


ccaaaaatag 


aaaaattaat 


cgggcgctat 


ggcgggtgcc 


1920 


ttaatcccag 


ctactcgggg 


gggctaaggc 


aggagaatcg 


cttgaacccg 


ggaggcggag 


1980 


gtttcagtga 


gccgagatcg 


cgccactgca 


ctccagcctg 


ggccagagtg 


agactccgtc 


2040 


tcaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaag 


agacttactt 


aaggtctaag 


atgaaaagca 


2100 


gggcctacgg 


agtagccacg 


tccgggcctg 


gtctggggag 


aggggaggat 


agggtcagtg 


2160 


acatggaatc 


ctgacgtggc 


caaaggtgcc 


cggtgccagg 


agatcatcga 


cccttggact 


2220 


aggatgggag 


gtcggggaac 


agaggatagc 


ccaggtggct 


tcttggaaat 


cacctttctc 


2280 


gggcagggtc 


caaggcactg 


ggttgacagt 


cctaacctgg 


ttccacccca 


ccccacccct 


2340 


ctgccaggtg 


gggcaggggt 


ttcgggctgg 


tggagcatgt 


gctgggacag 


gacagcatcc 


2400 


tcaatcaatc 


caacagcata 


ttcggttgca 


tcttctacac 


actacagcta 


ttgttaggtg 


2460 


agtggctccg 


ccccctccct 


gcccgccccg 


ccccgcccct 


catccccctt 


ggtcagctca 


2520 


gccccactcc 


atgcaatctt 


ggtgatccac 


acagctgaca 


gccagctagc 


tgctcatcac 


2580 


ggagcgtcct 


gcgggtgggg 


atgtggggag 


gtaactaaca 


ggagtctttt 


aattggttta 


2640 


agtactgtta 


gaggctgaag 


ggcccttaaa 


gacatcctag 


gtccccaggt 


tttttgtttg 


2700 


ttgttgtttt 


gagacagggt 


ctggctctgt 


tgcccaaagt 


gaggtctagg 


atgcccttag 


2760 


tgtgcactgg 


cgtgatctca 


gttcatggca 


acctctgcct 


ccctgcccaa 


gggatcctcc 


2820 


caccttagcc 


tcccaagcag 


ctggaatcac 


aggcgtgcac 


cactatgccc 


agctaatttt 


2880 


tgtttttgtt 


tttttttggt 


agagatggtg 


tctcgccatg 


ttgcccaggc 


tggtctcaag 


2940 


caatctgtct 


gcctcagcct 


cccaaagtgc 


tggggggatt 


acaggcgtga 


gctaccatgc 


3000 


cccaccaaca 


ccccagtttt 


gtggaaaaga 


tgccgaaatt 


cctttttaag 


gagaagctga 


3060 


gcatgagcta 


tcttttgtct 


catttagtgc 


tcagcaggaa 


aatttgtatc 


tagtcccata 


3120 


agaacagaga 


gaggaaccaa 


gggagtggaa 


gacgatggcg 


ccccaggcct 


tgctgatgcc 


3180 


atatgccgga 


gatgagacta 


tccattacca 


cccttcccag 


caggctccca 


cgctcccttt 


3240 


gagtcaccct 


tcccagctcc 


agagaaggca 


tcactgaggg 


aggcccagca 


ccatggtcct 


3300 


ggctgacaca 


tggttcagac 


ttggccgatt 


tatttaagaa 


attttattgc 


tcagaacttt 


3360 


ccctccctgg 


gcaatggcaa 


gagcttcaga 


gaccagtccc 


ttggagggga 


cctgttgaag 


3420 


ccttcttttt 


tttttttttt 


aagaaataat 


cttgctctgt 


tgcccaggct 


ggagtgcagt 


3480 


ggcacaatca 


tagctcactg 


taacctggct 


caagcgatcc 


tcctgagtag 


ctaggactat 


3540 
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aggcatgtca 


ctgcacccag 


ctaauLtctt 




ttctttccct 


ttgegacata 


3o 00 


gtctcgctct 


gtcaccaggc 


eggagegcag 


4. ~— -a /VT3 t~ M 

eggcacg ate 


ctggctcact 


gcaacctctg 


3o oO 


cctcccgggt 


ccaagcaatt 


cccccgcccc 


agccccctyd 


y tagceggga 


c c acagg eg c 


J /ZU 


gtgtcaccac 


gcccagc uaa 




Ctcagtggag 


avagggvCuC 


accacgcugg 


nan 
J /ou 


ctaggatggt 


+m ^ +\ — * 4- a 4» a 4- 


cgacccggcg 


acccacccgt 


cttggccccc 


caaag cgc c a 




ggattacagg 


cgtgagtcaa 


ccccaccggg 




ccgagacgaa 


gtcttgetet 


jyuu 


tgctgcccaa 


gctggaatgt 


ggtggcatga 


t ctcggccca 


ctgcaacctc 


cacctcctag 


-5 o n 


gttcaagcga 


4« 4» m4» m m m m t- 

ttctccaccc 


tagcctcccc 


agcagctggg 


attacaggtg 


cccatcaaca 


4020 


cacccggcta 


atetcegtat 


4_4-4-4--.4-A.-im---. 

ttttatcaga 


~ — _____ +- 4- 

gaeggggecu 


ugccacgccg 


gccaggctgc 


a r\ 0 n 
40o0 


tctcgaactc 


ctaacctcag 


gtgacccacc 


cccaccggcc 


ccccaaaaca 


ccgggaccac 


4140 


agg cat gage 


caccgtgccc 


agecgaautx 


ccaaatLtcc 


gacagagacc 


gggtctttct 


4^0U 


atgttgccca 


„ _ _ t» — _ 4» m4- 4- 

agctggtctc 


gaactcctag 


cctaaagcag 


tcttcccacc 


tcggcctccc 


4260 


agagtgtttg 


gaatacgtgc 


gtaagccacc 


acatctgccc 


tggagectet 


4. mt4- 4- 4* 4- e«MTMM> 

cgcuuragag 


yl Tift 

4320 


acccttccca 


gcagctcctg 


gcatctaggt 


agtgcagtga 


catcatggag 


tgttcgggag 


4380 


gtggccagtg 


cctgaagccc 


acaccggacc 


4. M 4> A» A 4> M> M M 

ctcttctgcc 


ttgcaggttg 


cctgcggaca 


4440 


cgctgggcct 


M A. M.A_ m mAb — -k 

ctgtcctgat 


getgetgage 


tccctggtgt 


ctctcgctgg 


4b 4b m4> m>4- m 4. m m 

ttctgtctac 


4500 


ctggcctgga 


4b M m Ab m.Ab Ab M A- Ab 

tcctgttctt 


cgtgctctat 


gatttctgea 


1_ 4- +_ 4. 4- r- 4- 4- 

ttgtttgtat 


caccacctat 


a c c r\ 
456 0 


gctatcaacg 


tgagectgat 


gtggctcagt 


ttceggaagg 


tccaagaacc 


ccagggcaag 


A C *5 ft 

4620 


gctaagaggc 


actgagccct 


caacccaagc 


caggctgacc 


ccacccgcuC 


tgctttggca 


A C O ft 

4680 


tgtgagcctt 


gectaagggg 


gcatatctgg 


gtccctagaa 


ggecctagat 


gtggggcttc 


/i n a ft 
4740 


tagattaccc 


cctcctcctg 


ccatacccac 


acatgacaat 


ggaccaaatg 


tgccacacgc 


VI O A A 

4800 


tcgctctttc 


ttacacccag 


tgcctctgac 


tctgtcccca 


kgggctggtc 


tccaaagctc 


A O C A 

4860 


4. 4- 4- mm — i 4- 1 i 

tttccactgc 


ccagggaggg 


aaggccccga 


gcaacaaagc 


4» 4b m 4b 4» m -5 4- rt 

ttct cagacc 


aaccagccaa 


A OO ft 

4720 


gtctgaacca 


*- m; f* /-r f- m: 4» ft r+r+ 

tgcgtcxgcc 


a l. gga ctyty 


gugccgggcc 


CCCCCCyyty 


4b 4b Mr m m 4- 4. m<4- m 

tcyccttccc 




tggagccggg 


aagggtgagu 


t«ciy ay yy cly cl 


g tggagggc c 


f- M. M 4. M-M>M- -J >S M> 

tgcugggaag 


M-M.4- M.M.4- 4. m 4. M> 

ggugguuacg 


Jul Vi 


ggtagtctca 


LCLCcagtgt 


gtggagccag 


c aagg c c t gg 


ggcaccauug 


gCCCCCaCCC 


jIUU 


ccaggaaaca 


ggctggcagc 


tcgcccctgc 


tgcccacagg 


agccaggccc 


4> 4» 4» fm^B 

ccccccccgg 


D JLO 0 


gaaggctgag 


cacacacctg 


gaagggcagg 


ctgcccttct 


ggttctgtaa 


atgcttgctg 


5220 


ggaagttctt 


ccttgagttt 


aactttaacc 


cctccagttg 


ccttatcgac 


cattccaagc 


5280 


cagtattggt 


agecttggag 


ggtcagggee 


aggttgtgaa 


ggtttttgtt 


ttgectatta 


5340 


tgccctgacc 


acttacctac 


atgecaagea 


ctgtttaaga 


acttgtgttg 


geagggtgea 


5400 
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gtggctcaca 


cctgtaatcc 


ctgtactttg 


ggaggccaag gcaggaggat cacttgaggc 


5460 


caggagttcc agaccagcct 


gggcaaaata 


gtgagacccc tgtctctaca aaaaaaaaaa 


5520 


aaaaaaaaaa 


ttagccaggc 


atggtggtgt 


atgtacctat agtcccaact aatcgggaag 


5580 


ctggcgggaa gactgcttga 


gcccagaagg 


ttgaggctgc agtgagccat gatcactgca 


5640 


ctccagcctg agcaacagag 


caagaccgtc 


tccaaaaaaa aacaaaaaac aaaaaaaaac 


5700 


ttgtgttaac gtgttaaact 


cgtttaatct 


ttacagtgat ttatgaggtg ggtactatta 


5760 


ttatccctat 


cttgatgata 


gggacagagt 


ggctaattag tatgcctgag atcacacagc 


5820 


tactgcagga ggctctcagg 


atttgaatcc 


acctggtcca tctggctcca gcatctatat 


5880 


gctttttttt 


ttgttggttt 


gtttttgaga 


cggac 


5915 


<210> 15 

<211> 5915 

<212> DNA 

<213> Homo sapiens 








<400> 15 
caccatcaga 


tgggacgtct 


gtgaaggaga 


gacctcatct ggcccacagc ttggaaagga 


60 


gagactgact 


gttgagttga 


tgcaagctca 


ggtgttgcca ggcgggcgcc atgatagtag 


120 


agaggttagg 


atactgtcaa 


gggtgtgtgt 


ggccaaagga gtggttctgt gaatgtatgg 


180 


gagaaaggga 


gaccgaccac 


caggaagcac 


tggtgaggca ggacccggga ggatgggagg 


240 


ctgcagcccg 


aatggtgcct 


gaaatagttt 


caggggaaat gcttggttcc cgaatcggat 


300 


cgccgtattc 


gctggatccc 


ctgatccgct 


ggtctctagg tcccggatgc tgcaattctt 


360 


acaacaggac 


ttggcatagg 


gtaagcgcaa 


atgctgttaa ccacactaac acactttttt 


420 


ttttcttttt 


tttttttgag 


acagagtctc 


actctgtcgg cctggctgga gtgcagtggc 


480 


acgatctcgg 


ctcactgcaa 


cctccggctc 


cccggctcaa gcaattctcc tgcctcagcc 


540 


tcccgagtag 


ctgggattac 


agacatgtgc 


caccacgccc ggctaatttt tgtattttta 


600 


gttgagatgg 


ggtttcacca 


tgttggcgag 


gctggtcttg aactcctgac ctcaggtaat 


660 


ccgccagcct 


cggcctccca 


aagtgctggg 


attacaagcg tgagccaccg tgcccggcca 


720 


acagttttta 


aatctgtgga 


gacttcattt 


cccttgatgc cttgcagccg cgccgactac 


780 


aactcccatc 


atgcctggca 


gccgctgggg 


ccgcgattcc gcacgtccct tacccgcttc 


840 


actagtcccg 


gcattcttcg 


ctgttttcct 


aactcgcccg cttgactagc gccctggaac 


900 


agccatttgg 


gtcgtggagt 


gcgagcacgg 


ccggccaatc gccgagtcag agggccagga 


960 


ggggcgcggc 


cattcgccgc 


ccggcccctg 


ctccgtggct ggttttctcc gcgggcgcct 


1020 


cgggcggaac 


ctggagataa 


tgggcagcac 


ctgggggagc cctggctggg tgcggctcgc 


1080 
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tctttgcctg 


acgggcttag 


tgctctcgct 


ctacgcgctg 


cacgtgaagg 


cggcgcgcgc 


1140 


ccgggaccgg 


gattaccgcg 


cgctctgcga 


cgtgggcacc 


gccatcagct 


gttcgcgcgt 


1200 


cttctcctcc 


aggtgtgcac 


gggagtggga 


ggcgtggggc 


ctcggagcag 


ggcggccagg 


1260 


atgccagatg 


attattctgg 


agtctgggat 


cggtgtgccc 


ggggaacgga 


cacggggctg 


1320 


gactgctcgc 


ggggtcgttg 


cacaggggct 


gagctaccca 


gcgatactgg 


tgttcgaaat 


1380 


aagagtgcga 


ggcaagggac 


cagacagtgc 


tggggactgg 


gattattccg 


gggactcgca 


1440 


cgtgaattgg 


atgccaagga 


ataacggtga 


ccaggaaagg 


cggggaggca 


ggatggcggt 


1500 


agagattgac 


gatggtctca 


aggacggcgc 


gcaggtgaag 


gggggtgttg 


gcgatggctg 


1560 


cgcccaggaa 


caaggtggcc 


cggtctggct 


gtgcgtgatg 


gccaggcgtt 


agcataatga 


1620 


cggaatacag 


aggaggcgag 


tgagtggcca 


gggagctgga 


gattctgggg 


tccagggcaa 


1680 


agataatctg 


cccccgactc 


ccagtctctg 


atgcaaaacc 


gagtgaaccg 


ttataccagc 


1740 


cttgccattt 


taagaattac 


ttaagggccg 


ggcgcggtgg 


cccactcctg 


taatcccagc 


1800 


actttgggag 


gccgaggcgg 


atggatcact 


tgaagtcagg 


agttgaccag 


cctggccaac 


1860 


atggtgaaag 


cctgtctcta 


ccaaaaatag 


aaaaattaat 


cgggcgctat 


ggcgggtgcc 


1920 


ttaatcccag 


ctactcgggg 


gggctaaggc 


aggagaatcg 


cttgaacccg 


ggaggcggag 


1980 


gtttcagtga 


gccgagatcg 


cgccactgca 


ctccagcctg 


ggccagagtg 


agactccgtc 


2040 


tcaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaag 


agacttactt 


aaggtctaag 


atgaaaagca 


2100 


gggcctacgg 


agtagccacg 


tccgggcctg 


gtctggggag 


aggggaggat 


agggtcagtg 


2160 


acatggaatc 


ctgacgtggc 


caaaggtgcc 


cggtgccagg 


agatcatcga 


cccttggact 


2220 


aggatgggag 


gtcggggaac 


agaggatagc 


ccaggtggct 


tcttggaaat 


cacctttctc 


2280 


gggcagggtc 


caaggcactg 


ggttgacagt 


cctaacctgg 


ttccacccca 


ccccacccct 


2340 


ctgccaggtg 


gggcaggggt 


ttcgggctgg 


tggagcatgt 


gctgggacag 


gacagcatcc 


2400 


tcaatcaatc 


caacagcata 


ttcggttgca 


tcttctacac 


actacagcta 


ttgttaggtg 


2460 


agtggctccg 


ccccctccct 


gcccgccccg 


ccccgcccct 


catccccctt 


ggtcagctca 


2520 


gccccactcc 


atgcaatctt 


ggtgatccac 


acagctgaca 


gccagctagc 


tgctcatcac 


2580 


ggagcgtcct 


gcgggtgggg 


atgtggggag 


gtaactaaca 


ggagtctttt 


aattggttta 


2640 


agtactgtta 


gaggctgaag 


ggcccttaaa 


gacatcctag 


gtccccaggt 


tttttgtttg 


2700 


ttgttgtttt 


gagacagggt 


ctggctctgt 


tgcccaaagt 


gaggtctagg 


atgcccttag 


2760 


tgtgcactgg 


cgtgatctca 


gttcatggca 


acctctgcct 


ccctgcccaa 


gggatcctcc 


2820 


caccttagcc 


tcccaagcag 


ctggaatcac 


aggcgtgcac 


cactatgccc 


agctaatttt 


2880 
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4-,~4_4_4_i_fc.._i_i_ 

tgtttttgtt 


4_ 4_ 4- 4- f- 4- 4- 


agagatggtg 


tctcgccacg 


ccgcccaggc 


tggtctcaag 


2940 


/-»—, -, 4- — 4_ — 4_ _ A_ 

caatctgtct 


gcctcagccc 


cccaaagtgc 


tggggggatt 


acaggcguga 


gctaccatgc 


3000 


cccaccaaca 


ccccagcttt 


g tggaaaag a 


cgccgaaacu 


nnf* 4- 4- 4- ^ *5 s^v 

ccwbtwLaag 


gagaagctga 


J OoO 


gcatgagcta 


ccccctgu.cu 


catccagtgc 


ucagcaggaa 


aaLLLyLaLC 


tagtcccata 


JX20 


agaa c ag ag a 


gaggaaccaa 


gggagtggaa 


gacgatggcg 


ccccaygcct 


tgetgatgee 


iloU 


atatgccgga 


gatgagacta 


tccattacca 


cccttcccag 


caggctccca 


cgctccctct 


3240 


gagtcaccct 


tcccagctcc 


agagaaggca 


tcactgaggg 


aggcccagca 


ccatggtcct 


3300 


ggctgacaca 


tggttcagac 


ttggccgatt 


fc> a- fc fc- 4. -» «k mm —i 

tatttaagaa 


attttattgc 


4> ~ — . a _ 4. 4_ 4_ 

tcagaacctt 


3360 


ccctccctgg 


gcaatggcaa 


gagcttcaga 


gaccagtccc 


ttggagggga 


cctgttgaag 


3420 


-4_4,fc4_4- 

ccttcttttt 


ttttttttcc 


aagaaataat 


— 4_ 4- M fc> 

cttgctctgu 


tgcccaggct 


ggagtgcagt 


■>/( Oft 

3480 


ggcacaatca 


tagctcactg 


taacctggcc 


caagcgatcc 


tcctgagtag 


ctaggactat 


3 540 


aggcatgtca 


ctgcacccag 


_4___4_4_4_4_4_4_ 

ctaatttttc 


4.4.4.4.4.^.4-4-4-4. 

tttttutttt 


4-4.4.4-4-4-4-4-4^4- 

ttcttttttc 


ttgegacata 


itftft 
3600 


gtctcgctct 


gtcaccaggc 


tggagtgcag 


tggcacgatc 


ttggctcact 


gcaacctctg 


3660 


cctcccgggt 


tcaagcaatt 


ttcctgcctc 


agcctcctga 


gtagctggga 


ctacaggcgc 


3720 


gtgtcaccac 


gcccagctaa 


tttttgtatt 


tttagtggag 


acagggtttc 


accatgttgg 


3780 


ctaggatggt 


ctcaatctct 


tgacctggtg 


atccatccgc 


cttggcctcc 


caaagtgeta 


3840 


ggattacagg 


cgtgagtcaa 


cctcaccggg 


.-.-14-4-4-4-4-4-4-4- 

catttttttt 


ttgagacgaa 


m<4- m4- &>m>m*« m 4- 

gtcttgetet 


3900 


tgctgcccaa 


gctggaatgt 


ggtggca tga 


tctcggctca 


ctgcaacctc 


cacctcctag 


3960 


gttcaagcga 


•4- 4- « 4- am -» m 4- 

ttccccacct 


tagcctcccc 


agcagctggg 


attacaggtg 


cccatcaaca 


4020 


cacccggcta 


atttttgtat 


fc. fc. i- 4- _» 4_ 4_ _» 

ttttattaga 


gatggggttt 


tgccatgttg 


gccaggctgc 


4080 


tctcgaactc 


ctaacctcag 


gtgatccacc 


cccattggcc 


tcccaaaata 


ctgggattac 


4140 


aggcatgagc 


caccgtgccc 


agctgaattt 


.__ fc.__-,_,fcfc»fc,fc>fci 

ctaaatxttt 


gatagagatc 


gggtctttct 


4200 


atgttgccca 


agctggtctt 


gaactcctag 


cctaaagcag 


tcttcccacc 


ccggcctccc 


4260 


agagtgtttg 


gaatacgtgc 


gtaagccacc 


acatctgccc 


tggagcctct 


4> M 4. fc> 4- fc> _v — 

tgttttagag 


4320 


acccttccca 


gcagctcctg 


gcacccaggc 


agtgcagtga 


cat ca cggag 


tgtcegggag 


A "3 o n 

43o0 


gtggccagtg 


cctgaagccc 


acaccggacc 


ctcttctgcc 


4- A* _m m *mmm 4m 4» 

l eg caggc eg 


cc t g egg a c a 


4440 


cgctgggcct 


ctgtcctgat 


gctgctgagc 


tccctggtgt 


ctctcgctgg 


fc> fc> -afc> x.4. 4- _ 

ttctgtctac 


4500 


ctggcctgga 


tcctgttctt 


cgtgctctat 


gatttctgca 


ttgtttgtat 


caccacctat 


4560 


gctatcaacg 


tgagcctgat 


gtggctcagt 


ttccggaagg 


tccaagaacc 


ccagggcaag 


4620 


gctaagaggc 


actgagccct 


caacccaagc 


caggctgacc 


teatctgett 


tgctttggca 


4680 


tgtgagcctt 


gcctaagggg 


gcatatctgg 


gtccctagaa 


ggecctagat 


gtggggcttc 


4740 
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tagattaccc 


cctcctcctg 


ccatacccgc 


acatgacaat ggaccaaatg tgccacacgc 


4800 


tcgctctttt 


ttacacccag 


tgcctctgac 


tctgtcccca tgggctggtc tccaaagctc 


4860 


tttccattgc 


ccagggaggg 


aaggttctga 


gcaataaagt ttcttagatc aatcagccaa 


4920 


gtctgaacca 


tgtgtctgcc 


atggactgtg 


gtgctgggcc tccctcggtg ttgccttctc 


4980 


tggagctggg 


aagggtgagt 


cagagggaga 


gtggagggcc tgctgggaag ggtggttatg 


5040 


ggtagtctca 


tctccagtgt 


gtggagtcag 


caaggcctgg ggcaccattg gcccccaccc 


5100 


ccaggaaaca 


ggctggcagc 


tcgctcctgc 


tgcccacagg agccaggcct cctctcctgg 


5160 


gaaggctgag 


cacacacctg 


gaagggcagg 


ctgcccttct ggttctgtaa atgcttgctg 


5220 


ggaagttctt 


ccttgagttt 


aactttaacc 


cctccagttg ccttatcgac cattccaagc 


5280 


cagtattggt 


agccttggag 


ggtcagggcc 


aggttgtgaa ggtttttgtt ttgcctatta 


5340 


tgccctgacc 


acttacctac 


atgccaagca 


ctgtttaaga acttgtgttg gcagggtgca 


5400 


gtggctcaca 


cctgtaatcc 


ctgtactttg 


ggaggccaag gcaggaggat cacttgaggc 


5460 


caggagttcc 


agaccagcct 


gggcaaaata 


gtgagacccc tgtctctaca aaaaaaaaaa 


5520 


aaaaaaaaaa 


ttagccaggc 


atggtggtgt 


atgtacctat agtcccaact aatcgggaag 


5580 


ctggcgggaa 


gactgcttga 


gcccagaagg 


ttgaggctgc agtgagccat gatcactgca 


5640 


ctccagcctg 


agcaacagag 


caagaccgtc 


tccaaaaaaa aacaaaaaac aaaaaaaaac 


5700 


ttgtgttaac 


gtgttaaact 


cgtttaatct 


ttacagtgat ttatgaggtg ggtactatta 


5760 


ttatccctat 


cttgatgata 


gggacagagt 


ggctaattag tatgcctgag atcacacagc 


5820 


tactgcagga 


ggctctcagg 


atttgaatcc 


acctggtcca tctggctcca gcatctatat 


5880 


gctttttttt 


ttgttggttt 


gtttttgaga 


cggac 


5915 



<210> 16 

<211> 5915 

<212> DNA 

<213> Homo sapiens 

<400> 16 

caccatcaga tgggacgtct gtgaaggaga gacctcatct ggcccacagc ttggaaagga 60 

gagactgact gttgagttga tgcaagctca ggtgttgcca ggcgggcgcc atgatagtag 120 

agaggttagg atactgtcaa gggtgtgtgt ggccaaagga gtggttctgt gaatgtatgg 180 

gagaaaggga gaccgaccac caggaagcac tggtgaggca ggacccggga ggatgggagg 240 

ctgcagcccg aatggtgcct gaaatagttt caggggaaat gcttggttcc cgaatcggat 300 

cgccgtattc gctggatccc ctgatccgct ggtctctagg tcccggatgc tgcaattctt 360 

acaacaggac ttggcatagg gtaagcgcaa atgctgttaa ccacactaac acactttttt 420 
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ttttcttttt 


tttttttgag 


acagagtctc actctgtcgg 


cctggctgga 


gtgcagtggc 


480 


acgatctcgg 


ctcactgcaa 


cctccggctc cccggctcaa 


gcaattctcc 


tgcctcagcc 


540 


tcccgagtag 


ctgggattac 


aggcatgtgc caccacgccc 


ggctaatttt 


tgtattttta 


600 


gttgagatgg 


ggtttcacca 


tgttggcgag gctggtcttg 


aactcctgac 


ctcaggtaat 


660 


ccgccagcct 


cggcctccca 


aagtgctggg attacaagcg 


tgagccaccg 


tgcccggcca 


720 


acagttttta 


aatctgtgga 


gacttcattt cccttgatgc 


cttgcagccg 


cgccgactac 


780 


aactcccatc 


atgcctggca 


gccgctgggg ccgcgattcc 


gcacgtccct 


tacccgcttc 


840 


actagtcccg 


gcattcttcg 


ctgttttcct aactcgcccg 


cttgactagc 


gccctggaac 


900 


agccatttgg 


gtcgtggagt 


gcgagcacgg ccggccaatc 


gccgagtcag 


agggccagga 


960 


ggggcgcggc 


cattcgccgc 


ccggcccctg ctccgtggct 


ggttttctcc 


gcgggcgcct 


1020 


cgggcggaac 


ctggagataa 


tgggcagcac ctgggggagc 


cctggctggg 


tgcggctcgc 


1080 


tctttgcctg 


acgggcttag 


tgctctcgct ctacgcgctg 


cacgtgaagg 


cggcgcgcgc 


1140 


ccgggaccgg 


gattaccgcg 


cgctctgcga cgtgggcacc 


gccatcagct 


gttcgcgcgt 


1200 


cttctcctcc 


aggtgtgcac 


gggagtggga ggcgtggggc 


ctcggagcag 


ggcggccagg 


1260 


atgccagatg 


attattctgg 


agtctgggat cggtgtgccc 


ggggaacgga 


cacggggctg 


1320 


gactgctcgc 


ggggtcgttg 


cacaggggct gagctaccca 


gcgatactgg 


tgttcgaaat 


1380 


aagagtgcga 


ggcaagggac 


cagacagtgc tggggactgg 


gattattccg 


gggactcgca 


1440 


cgtgaattgg 


atgccaagga 


ataacggtga ccaggaaagg 


cggggaggca 


ggatggcggt 


1500 


agagattgac 


gatggtctca 


aggacggcgc gcaggtgaag 


gggggtgttg 


gcgatggctg 


1560 


cgcccaggaa 


caaggtggcc 


cggtctggct gtgcgtgatg 


gccaggcgtt 


agcataatga 


1620 


cggaatacag 


aggaggcgag 


tgagtggcca gggagctgga 


gattctgggg 


tccagggcaa 


1680 


agataatctg 


cccccgactc 


ccagtctctg atgcaaaacc 


gagtgaaccg 


ttataccagc 


1740 


cttgccattt 


taagaattac 


ttaagggccg ggcgcggtgg 


cccactcctg 


taatcccagc 


1800 


actttgggag 


gccgaggcgg 


atggatcact tgaagtcagg 


agttgaccag 


cctggccaac 


1860 


atggtgaaag 


cctgtctcta 


ccaaaaatag aaaaattaat 


cgggcgctat 


ggcgggtgcc 


1920 


ttaatcccag 


ctactcgggg 


gggctaaggc aggagaatcg 


cttgaacccg 


ggaggcggag 


1980 


gtttcagtga 


gccgagatcg 


cgccactgca ctccagcctg 


ggccagagtg 


agactccgtc 


2040 


tcaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaag agacttactt 


aaggtctaag 


atgaaaagca 


2100 


gggcctacgg 


agtagccacg 


tccgggcctg gtctggggag 


aggggaggat 


agggtcagtg 


2160 


acatggaatc 


ctgacgtggc 


caaaggtgcc cggtgccagg 


agatcatcga 


cccttggact 


2220 
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aggatgggag gtcggggaac agaggatagc ccaggtggct tcttggaaat cacctttctc 2280 

gggcagggtc caaggcactg ggttgacagt cctaacctgg ttccacccca ccccacccct 2340 

ctgccaggtg gggcaggggt ttcgggctgg tggagcatgt gctgggacag gacagcatcc 2400 

tcaatcaatc caacagcata- ttcggttgca tcttctacac actacagcta ttgttaggtg 2460 

agtggctccg ccccctccct gcccgccccg ccccgcccct catccccctt ggtcagctca 2520 

gccccactcc atgcaatctt ggtgatccac acagctgaca gccagctagc tgctcatcac 2580 

ggagcgtcct gcgggtgggg atgtggggag gtaactaaca ggagtctttt aattggttta 2640 

agtactgtta gaggctgaag ggcccttaaa gacatcctag gtccccaggt tttttgtttg 2700 

ttgttgtttt gagacagggt ctggctctgt tgcccaaagt gaggtctagg atgcccttag 2760 

tgtgcactgg cgtgatctca gttcatggca acctctgcct ccctgcccaa gggatcctcc 2820 

caccttagcc tcccaagcag ctggaatcac aggcgtgcac cactatgccc agctaatttt 2880 

tgtttttgtt tttttttggt agagatggtg tctcgccatg ttgcccaggc tggtctcaag 2940 

caatctgtct gcctcagcct cccaaagtgc tggggggatt acaggcgtga gctaccatgc 3000 

cccaccaaca ccccagtttt gtggaaaaga tgccgaaatt cctttttaag gagaagctga 3060 

gcatgagcta tcttttgtct catttagtgc tcagcaggaa aatttgtatc tagtcccata 3120 

agaacagaga gaggaaccaa gggagtggaa gacgatggcg ccccaggcct tgctgatgcc 3180 

atatgccgga gatgagacta tccattacca cccttcccag caggctccca cgctcccttt 3240 

gagtcaccct tcccagctcc agagaaggca tcactgaggg aggcccagca ccatggtcct 3300 

ggctgacaca tggttcagac ttggccgatt tatttaagaa attttattgc tcagaacttt 3360 

ccctccctgg gcaatggcaa gagcttcaga gaccagtccc ttggagggga cctgttgaag 3420 

ccttcttttt tttttttttt aagaaataat cttgctctgt tgcccaggct ggagtgcagt 3480 

ggcacaatca tagctcactg taacctggct caagcgatcc tcctgagtag ctaggactat 3540 

aggcatgtca ctgcacccag ctaatttttt tttttttttt tttttttttt ttgcgacata * 3600 

gtctcgctct gtcaccaggc tggagtgcag tggcacgatc ttggctcact gcaacctctg 3660 

cctcccgggt tcaagcaatt ttcctgcctc agcctcctga gtagctggga ctacaggcgc 3720 

gtgtcaccac gcccagctaa tttttgtatt tttagtggag acagggtttc accatgttgg 3780 

ctaggatggt ctcaatctct tgacctggtg atccatccgc cttggcctcc caaagtgcta 3840 

ggattacagg cgtgagtcaa cctcaccggg catttttttt ttgagacgaa gtcttgctct 3900 

tgctgcccaa gctggaatgt ggtggcatga tctcggctca ctgcaacctc cacctcctag 3960 

gttcaagcga ttctccacct tagcctcccc agcagctggg attacaggtg cccatcaaca 4020 

cacccggcta atttttgtat ttttattaga gatggggttt tgccatgttg gccaggctgc 4080 
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tctcgaactc 


ctaacctcag 


gtgatccacc 


cccattggcc 


tcccaaaata 


ctgggattac 


4140 


aggcatgagc 


caccgtgccc 


agctgaattt 


ctaaattttt 


gatagagatc 


gggtctttct 


4200 


atgttgccca 


agctggtctt 


gaactcctag 


cctaaagcag 


tcttcccacc 


tcggcctccc 


4260 


agagtgtttg 


gaatacgtgc 


gtaagccacc 


acatctgccc 


tggagcctct 


tgttttagag 


4320 


acccttccca 


gcagctcctg 


gcatctaggt 


agtgcagtga 


catcatggag 


tgttcgggag 


4380 


gtggccagtg 


cctgaagccc 


acaccggacc 


ctcttctgcc 


ttgcaggttg 


cctgcggaca 


4440 


cgctgggcct 


ctgtcctgat 


gctgctgagc 


tccctggtgt 


ctctcgctgg 


ttctgtctac 


4500 


ttggcctgga 


tcctgttctt 


cgtgctctat 


gatttctgca 


ttgtttgtat 


caccacctat 


4560 


gctatcaacg 


tgagcctgat 


gtggctcagt 


ttccggaagg 


tccaagaacc 


ccagggcaag 


4620 


gctaagaggc 


actgagccct 


caacccaagc 


caggctgacc 


tcatctgctt 


tgctttggca 


4680 


tgtgagcctt 


gcctaagggg 


gcatatctgg 


gtccctagaa 


ggccctagat 


gtggggcttc 


4740 


tagattaccc 


cctcctcctg 


ccatacccgc 


acatgacaat 


ggaccaaatg 


tgccacacgc 


4800 


tcgctctttt 


ttacacccag 


tgcctctgac 


tctgtcccca 


tgggctggtc 


tccaaagctc 


4860 


tttccattgc 


ccagggaggg 


aaggttctga 


gcaataaagt 


ttcttagatc 


aatcagccaa 


4920 


gtctgaacca 


tgtgtctgcc 


atggactgtg 


gtgctgggcc 


tccctcggtg 


ttgccttctc 


4980 


tggagctggg 


aagggtgagt 


cagagggaga 


gtggagggcc 


tgctgggaag 


ggtggttatg 


5040 


ggtagtctca 


tctccagtgt 


gtggagtcag 


caaggcctgg 


ggcaccattg 


gcccccaccc 


5100 


ccaggaaaca 


ggctggcagc 


tcgctcctgc 


tgcccacagg 


agccaggcct 


cctctcctgg 


5160 


gaaggctgag 


cacacacctg 


gaagggcagg 


ctgcccttct 


ggttctgtaa 


atgcttgctg 


5220 


ggaagttctt 


ccttgagttt 


aactttaacc 


cctccagttg 


ccttatcgac 


cattccaagc 


5280 


cagtattggt 


agccttggag 


ggtcagggcc 


aggttgtgaa 


ggtttttgtt 


ttgcctatta 


5340 


tgccctgacc 


acttacctac 


atgccaagca 


ctgtttaaga 


acttgtgttg 


gcagggtgca 


5400 


gtggctcaca 


cctgtaatcc 


ctgtactttg 


ggaggccaag 


gcaggaggat 


cacttgaggc 


5460 


caggagttcc 


agaccagcct 


gggcaaaata 


gtgagacccc 


tgtctctaca 


aaaaaaaaaa 


5520 


aaaaaaaaaa 


ttagccaggc 


atggtggtgt 


atgtacctat 


agtcccaact 


aatcgggaag 


^580 


ctggcgggaa 


gactgcttga 


gcccagaagg 


ttgaggctgc 


agtgagccat 


gatcactgca 


5640 


ctccagcctg 


agcaacagag 


caagaccgtc 


tccaaaaaaa 


aacaaaaaac 


aaaaaaaaac 


5700 


ttgtgttaac 


gtgttaaact 


cgtttaatct 


ttacagtgat 


ttatgaggtg 


ggtactatta 


5760 


ttatccctat 


cttgatgata 


gggacagagt 


ggctaattag 


tatgcctgag 


atcacacagc 


5820 


tactgcagga 


ggctctcagg 


atttgaatcc 


acctggtcca 


tctggctcca 


gcatctatat 


5880 
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gctttttttt ttgttggttt gtttttgaga cggac 



5915 



<210> 17 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> vk258l G>C VIC probe sequence 

<400> 17 

tcatcacgga gcgtc 15 



<210> 18 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> vk2581 G>C FAM probe sequence 



<210> 19 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer 

<400> 19 

ggtgatccac acagctgaca 20 



<210> 20 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer 



<210> 21 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> vk3294 T>C VIC probe sequence 

<400> 21 

ccaggaccat ggtgc 15 



<400> 18 
tcatcaccga gcgtc 



15 



<400> 20 

cctgttagtt acctccccac ate 



23 
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<210> 22 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> vk3294 T>C FAM probe sequence 

<400> 22 
ccaggaccgt ggtgc 



<210> 23 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer 

<400> 23 

gctccagaga aggcatcact 20 



<210> 24 

<211> 22 

<212> DNA 

<213> Artificial 



<210> 25 

<211> 15 

<212> DNA 

<213> Artificial 

<220> 

<223> vk4 769 G>A VIC probe sequence 

<400> 25 

atacccgcac atgac 15 



<210> 26 

<211> 16 

<212> DNA 

<213> Artificial 

<220> 

<223> vk4769 G>A FAM probe sequence 



<220> 
<223> 



PCR primer 



<400> 24 

gccaagtctg aaccatgtgt ca 



22 



<400> 26 
catacccaca catgac 



16 
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<210> 27 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer 

<400> 27 

gfcccctagaa ggccctagat gt 22 



<210> 28 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer 



<210> 29 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer 

<400> 29 

ccaatcgccg agtcagagg 19 



<210>. 30 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer 



<210> 31 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer 

<400> 31 

aggggaggat agggtcagtg 20 



<400> 28 

gtgtggcaca tttggtccat t 



21 



<400> 30 

cccagtcccc agcactgtct 



20 
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<210> 32 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer 

<400> 32 

cctgttagtt acctccccac a 21 



<210> 33 

<211> 20 

<212> DNA 

<213> Artificial 



<220> 

<223> PCR primer 
<400> 33 

atacgtgcgt aagccaccac 20 



<210> 34 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> PCR primer 

<400> 34 

acccagatat gcccccttag 20 
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